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identification of the genetic determinants of thie polymorphic CYP3A5 expression 



The present invention relates to a polymorphic CYP3A5 polynucleotide. Moreover, the 
Invention relates to genes or vectors comprising the polynucleotides of the invention and to 
a host cell genetically engineered with the polynucleotide or gene of the invention. Further, 
the invention relates to methods for producing molecular variant polypeptides or fragments 
thereof, methods for producing cells capable of expressing a molecular variant polypeptide 
and to a polypeptide or fragment thereof encoded by the polynucleotide or the gene of the 
invention or which is obtainable by the method or from the cells produced by the method of 
the invention. Furthermore, the invention relates to an antibody which binds specifically the 
polypeptide of the invention. Moreover, the invention relates to a transgenic non-human 
animal. The invention also relates to a solid support comprising one or a plurality of the 
above mentioned polynucleotides, genes, vectors, polypeptides, antibodies or host cells. 
Furthermore, methods of identifying a polymorphism, Identifying and obtaining a pro-drug 
or drug or an inhibitor are also encompassed by the present invention. In addition, the 
invention relates to methods for producing of a pharmaceutical composition and to 
methods of diagnosing a disease. Further, the invention relates to a method of detection of 
the polynucleotide of the inventioti. Furthermore, cornprised byjhe present invention are a 
diagnostic and a pharmaceutical composition. Even more, the invention relates to uses of 
the polynucleotides, genes, vectors, polypeptides or antibodies of the invention. Finally, 
the invention relates to a diagnostic kit. 

The CYP3A enzymes play a particularly Important role in drug metabolism. This is due to 
their abundant expression in the liver combined with a broad substrate spectrum. Indeed, it 
is estimated that CYP3A isozymes collectively comprise the largest portion of the liver 
CYP protein (Thummel, Annu Rev Pharmacol Toxicol 38 (1998), 389-430) and that they 
are involved in the metabolism of 45 % - 60 % of all currently used drugs (Li, Toxicology 
104 (1995), 1-8; Evans, Science 286 (1999), 487-91). In addition to drugs, CYP3A 
isozymes metabolise a variety of other compounds including steroid hormones, toxins and 
carcinogens. For example, CYP3A isozymes metabolise aflatoxin Bi (Wang, Biochemistry 
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37 (1998), 12536 - 45; Gillam, Arch Biochem Biophys 317 (1995), 374-84; Li, Cancer Res 
57 (1997), 641-5), a mycotoxin strongly innplicated in the etiology of liver cancer, which is a 
major cause of premature death in many areas of Africa and Asia (Henry, Science 286 
(1999), 2453-4). 

The hepatic expression and activity of CYP3A isozymes is inter-individually variable and 
this variability is the reason for harmful interactions frequently encountered in development 
and application of drugs that are CYP3A substrates. It has also been postulated that 
variable CYP3A expression could affect an individual's predisposition to cancers caused 
by environmental carcinogens which are metabolised by CYP3A, The elucidation of factors 
controlling an individual's CYP3A activity could permit personalised dose adjustments in 
therapies with its substrates and also lead to the identification of sub-populations at 
increased risk for several common cancers. However, despite considerable efforts, our 
understanding of factors governing CYP3A activity and expression is limited. There are 
several reasons for this: An average human liver may express products of up to four 
CYP3A genes (Gellner, Pharmacogenetics 11 (2001), 111 - 121), but their respective 
contributions to the hepatic CYP3A pool are still a matter of debate. The differentiation 
between the individual CYP3A proteins by enzymatic methods has proven difficult due to 
overlapping substrate specifities and due to the considerable effect of reconstitution 
conditions on their catalytic activities. RNA and protein analysis indicate that CYP3A4 
forms the bulk of the hepatic CYP3A protein and its expression is highly variable 
(Thummel, Annu Rev Pharmacol Toxicol 38 (1998), 389-430). Less well understood are 
the contributions of the other CYP3A genes. CYP3A5 is widely considered the second 
most important CYP3A protein in the liver, but the available data are conflicting, since its 
expression has been reported to be present in 10 % to 97 % of human livers (Aoyama, J 
Biol Chem 264 (1989), 10388-95; Wrighton, Mol Pharmacol 38 (1990), 207-13; Schuetz, 
Pharmacogenetics 4 (1994), 11-20; JounaidI, Biochem Biophys Res Commun 221 (1996), 
466-70; Boobis, Br J Clin Pharmacol 42 (1996), 81-9). The possible reasons for these 
discrepancies include small sample sizes, interethnic differences and poor specificity of 
probes used to measure CYP3A5 expression. The third CYP3A, CYP3A7, was originally 
described in the human fetal liver where it accounts for about 50 % of the total CYP protein 
(Wrighton, Biochem Pharmacol 37 (1988), 3053-5). More recent studies indicate 
constitutive or induced expression of CYP3A7 in adult human livers, but its quantification 
has been hampered by the lack of specific. antibodies. Similarly, no protein expression 
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data are available for the recently identified fourth member of the family, CYP3A43 
(Gellner, Phamnacogenetics 11 (2001), 111 -121). 



Clinical studies indicate that a major portion of the inter-individual CYP3A variability is 
caused by genetic factors (Ozdemir, Pharmacogenetics 10 (2000), 373-88), but the 
identities of the latter remain unknown. In respect of CYP3A5, a protein variant 
(Thr398Asn) has been found in 2 out of 5 individuals deficient In CYP3A5 expression 
(Jounaidi, Biochem Biophys Res Commun 221 (1996), 466-70), but its significance has not 
been verified on a larger number of liver samples and in functional studies. In addition, a 
haplotype consisting of two linked polymorphisms has been described in the 5' flanking 
region of the CYP3A5 gene which is associated with increased expression and activity of 
the gene (Paulussen, Pharmacogenetics 10 (2000), 415-24). However, only a small 
sample set (n=29) was analysed for the genotype and the phenotype. Moreover, the single 
nucleotide polymorphisms (SNPs) which have been disclosed in said document are not 
suitable for a reliable prediction of CYP3A5 dysfunction and/or dysregulation and the 
problems caused thereby. This document does not suggest the existence of further 
haplotypes. 

Thus, improved means and methods for diagnosing and treating a variety of diseases and 
disorders based on dysfunctions or dysregulations of drug metabolism were not available 
yet but are nevertheless highly desirable. Thus, the technical problem underlying the 
present invention is to comply with the above specified needs. 

The solution to this technical problem is achieved by providing the embodiments 
characterized in the claims. 

Accordingly, the present Invention relates to a polynucleotide comprising a polynucleotide 

selected from the group consisting of: 

(a) a polynucleotide having the nucleic acid sequence of SEQ ID NO: 54, 56, 58, 
62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 94, 96, 98, 100, 
102, 106, 108, 110, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 
123, 124, 125, 126, 128, 129, 130, 131, 133, 134, 135, 136, 137, 138, 139, 
140, 142, 143, 149, 151, 153, 155, 157, 159, 161, 163, 165, 169, 171, 173, 
176, 177, 179, 181, 183, 185, 187, 189, 193, 195, 197, 199, 201, 207, 208, 
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209, 210, 211, 212, 213, 214, 216, 218, 219, 220, 221, 222, 223, 224, 225, 
226, 227, 228, 231 , 232, 233, 235, or 236; 

(b) a polynucleotide encoding a polypeptide having the amino acid sequence of 
SEQ ID NO: 127, 132, 141, 215, 229, or 234; 

(c) a polynucleotide capable of hybridizing to a CYP3A5 gene, wherein said 
polynucleotide is having a nucleotide exchange, a nucleotide deletion of at 
least one nucleotide, or at least one additional nucleotide at a position 
corresponding to position -20643, -20555, -20359, -20367, -20329, -20323, 
-20310, -6200, -6177, -4336, -3990, -3868, -3844, -3557, -1617, -795, -86, 
-74, 136, 174 to 176, 230, 3705, 3709/3710, 5215", 5235, 5516, 7182, 7207, 
7303, 7424/7427, 12907, 13028, 13077, 13173, 13226, 13376, 14720, 
14836, 14903, 15788, 16079, 16931/16932, 16993, 17163, 19069, 19165, 
19208, 27050, 27131/27132, 27526, 31499, 31551 or 31611 of the CYP3A5 
gene (Accession No: AF280107.1, wherein position 166220 has been 
numbered +1 and position 174832 has been numbered +8613, and 
Accession No: AC005020.2, wherein position 27341 has been numbered 
+8614); 

(d) a polynucleotide capable of hybridizing to a CYP3A5 gene, wherein said 
polynucleotide Is having an A at a position corresponding to position -20555, 
-20329, -20323, -4336, -3868, -3844, -795, -86, 230, 5235, 5516, 7182, 
7303, 12907, 13028, 13376, 19069 or 19165 of the CYP3A5 gene 
(Accession No: AF280107.1, wherein position 166220 has been numbered 
-f1 and position 174832 has been numbered +8613, and Accession No: 
AC005020.2, wherein position- 27341 has been numbered +8614), a T at a 
position corresponding to position -20367, -6200, -74, 3705, 5215, 7207, 
14836, 17163, 19208 or 27526 of the CYP3A5 gene (Accession No: 
AF280107.1, wherein position 166220 has been numbered +1 and position 
174832 has been numbered +8613, and Accession No: AC005020.2, 
wherein position 27341 has been numbered +8614), a G at a position 
corresponding to position -6177, -3990, 13077, 14720, 14903, 16993 or 
27050 of the CYP3A5 gene (Accession No: AF280107,1, wherein position 
166220 has been numbered +1 and position 174832 has been numbered 
+8613, and Accession No: AC005020.2, wherein position 27341 has been 
numbered +8614), a C at a position corresponding to position -20643, 
-20310, -3557, -1617, 136, 13173, 13226, 15788, 16079, 31499, 31551 or 
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31611 of the CYP3A5 gene (Accession No: AF280107.1, wherein position 
166220 has been numbered +1 and position 174832 has been numbered 
+8613, and Accession No: AC005020.2, wherein position 27341 has been 
numbered +8614), nucleotide deletions at positions corresponding to 
positions 174 to 176 of the CYP3A5 gene (Accession No: AF280107.1, 
wherein position 166220 has been numbered +1 and position 174832 has 
been numbered +8613), an additional nucleotide at a position corresponding 
to position 3709/3710 or 27131/27132 of the CYP3A5 gene (Accession No: 
AF280107.1, wherein position 166220 has been numbered +1 and position 
174832 has been numbered +8613, and Accession No: AC005020.2, 
wherein position 27341 has-been numbered +8614), three additional 
nucleotides at a position corresponding to position 16931/16932 of the 
CYP3A5 gene (Accession No: AF280107.1, wherein position 166220 has 
been numbered +1 and position 174832 has been numbered +8613, and 
Accession No: AC005020.2, wherein position 27341 has been numbered 
+8614), or a deletion of two nucleotides and nine additional nucleotides 
inserted at a position corresponding to position 7424 to 7427 of the CYP3A5 
gene (Accession No: AF280107.1, wherein position 166220 has been 
numbered +1 and position 174832 has been numbered +8613); 

(e) a polynucleotide encoding a CYP3A5 polypeptide or fragment thereof, 
wherein said polypeptide comprises an amino acid substitution at a position 
corresponding to position 30, 100, 130, 149 or 488 of the CYP3A5 
polypeptide (Accession No: NP__000768.1), or at least one amino acid 

-r- ' exchange or a stop codon at a position corresponding to position 30 to 34 or 
346 to 348 of the CYP3A5 polypeptide (Accession No: NP„000768.1); and 

(f) a polynucleotide encoding a CYP3A5 polypeptide or fragment thereof, 
wherein said polypeptide comprises amino acid substitutions of HGLFK to 
YGTF. (with the period meaning termination) at a position corresponding to 
position 30 to 34 of the CYP3A5 polypeptide (Accession No: NP_000768,1, 
an amino acid substitution of S to Y at a position corresponding to position 
100 of the CYP3A5 polypeptide (Accession No: NP_000768.1), an amino 
acid substitution of R to Q at a position corresponding to position 130 of the 
CYP3A5 polypeptide (Accession No: NP_000768.1), an amino acid 
substitution of I to T at a position corresponding to position 149 of the 
CYP3A5 polypeptide (Accession No: NP_000768.1), an amino acid 
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substitutions of TYD to YL. (with the period meaning termination) at position 
corresponding to position 346 to 348 of the CYP3A5 polypeptide (Accession 
No: NP_000768.1), or an amino acid substitution of I to T at a position 
corresponding to position 488 of the CYP3A5 polypeptide (Accession No: 
NP„000768.1). 

In the context of the present invention the term "polynucleotides" or the term "polypeptides" 
refers to different variants of a polynucleotide or polypeptide. Said variants comprise a 
reference or wild type sequence of the polynucleotides or polypeptides of the invention as 
well as variants which differ therefrom in structure or composition. Reference or wild type 
sequences for the polynucleotides are Accession No: AF280107.1 and AC005020.2. 
Reference or wild type sequence for the polypeptides of the invention is Accession No: 
NP_000768.1. The differences in structure or composition usually occur by way of 
nucleotide or amino acid substitution (s), addition(s) and/or deletjon(s). Preferably, said 
nucleotide substitution(s), addition(s) or deletion(s) result(s) in one or more changes of the 
corresponding amino acid(s) of the polypeptides of the Invention. The variant 
polynucleotides and polypeptides also comprise fragments of said polynucleotides or 
polypeptides of the invention. The polynucleotides and polypeptides as well as the 
aforementioned fragments thereof of the present invention are characterized as being 
associated with a CYP3A5 dysfunction or dysregulation. Preferably, said dysfunctions or 
dysregulations referred to in the present invention cause a disease or disorder or a 
prevalence for said disease or disorder. Preferably, as will be discussed below in detail, 
said disease is cancer or diseases including cardiovascular diseases, diabetes and AIDS 
or" any other disease caused by a dysfunction or dysregulation due to a polynucleotide or 
polypeptides of the invention. 

The polynucleotides of the invention include polynucleotides that have at least 70%, 
preferably at least 75%, at least 80%, at least 85%, at least 90% or at least 95% sequence 
identity to a CYP3A5 gene, wherein said polynucleotide is having a nucleotide exchange, 
a nucleotide deletion of at least one nucleotide, or at least one additional nucleotide at a 
position corresponding to position -20643, -20555, -20359, -20367, -20329, -20323, - 
20310, -6200, -6177, -4336, -3990, -3868, -3844, -3557, -1617, -795, -86, -74, 136, 174 
to 176, 230, 3705, 3709/3710, 5215, 5235, 5516, 7182, 7207, 7303, 7424/7427, 12907, 
13028, 13077, 13173, 13226, 13376, 14720, 14836, 14903, 15788, 16079, 16931/16932, 
16993, 17163, 19069, 19165, 19208, 27050, 27131/27132, 27526, 31499, 31551 or 31611 

6 



wo 02/053775 PCT/EPOl/15290 

of the CYP3A5 gene (Accession No: AF280107.1, wherein position 166220 has been 
numbered +1 and position 174832 has been numbered +8613, and Accession No: 
AC005020.2, wherein position 27341 has been numbered +8614). 

The term "hybridizing" as used herein refers to polynucleotides which are capable of 
hybridizing to the polynucleotides of the invention or parts thereof which are associated 
with a CYP3A5 dysfunction or dysregulation. Thus, said hybridizing polynucleotides are 
also associated with said dysfunctions and dysregulations. Therefore, said polynucleotides 
may be useful as probes in Northern or Southern Blot analysis of RNA or DNA 
preparations, respectively, or can be used as oligonucleotide primers in PGR analysis 
dependent on their respective size. Also comprised by the invention are hybridizing 
polynucleotides which are useful for analysing DNA-Protein interactions via, e.g., 
electrophoretic mobility shift analysis (EMSA). Preferably, said hybridizing polynucleotides 
comprise at least 10, more preferably at least 15 nucleotides in length while a hybridizing 
polynucleotide of the present invention to be used as a probe preferably comprises at least 
100, more preferably at least 200, or most preferably at least 500 nucleotides in length. 
It is well known in the art how to perform hybridization experiments with nucleic acid 
molecules, i.e. the person skilled in the art knows what hybridization conditions s/he has to 
use in accordance with the present invention. Such hybridization conditions are referred to 
in standard text books such as Molecular Cloning A Laboratory Manual, Cold Spring 
Harbor Laboratory (1989) N.Y. Preferred in accordance with the present inventions are 
polynucleotides which are capable of hybridizing to the polynucleotides of the invention or 
parts thereof which are associated with a CYP3A5 dysfunction or dysregulation under 
stringent hybridization conditions, i.e. which do not cross hybridize Ao unrelated 
polynucleotides such as polynucleotides encoding a polypeptide different from the 
CYP3A5 polypeptides of the invention. 

Nucleic acid hybridization will be affected by such conditions as salt concentration, 
temperature, or organic solvents, in addition to the base composition, length of the 
complementary strands and the number of nucleotide base mismatches between the 
hybridizing nucleic acids, as will be readily appreciated by those skilled in the art. Stringent 
temperature conditions will generally include temeratures in excess of 30°C, typically 
37°C, and preferably in excess of 45°C. Stringent salt conditions will ordinarily be less than 
1000 mM, typically less than 500 mM and preferably less than 200 mM. However, the 
combination of parameters is much more important than the measure of any single 
parameter. See, e.g., Wetmur and Davidson, 1968. Probe sequences may also hybridize 
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specifically to duplex DNA under certain conditions to form triplex or higher order DNA 
complexes. The preparation of such probes and suitable hybridization conditions are well 
known in the art. 

The term "percent sequence identity" or "identical" in the context of nucleic acid sequences 
refers to the residues in the two sequences which are the same when aligned for 
maximum correspondence. The length of sequence identity comparison may be over a 
strech of at least nine nucleotides, usually at least 20 nucleotides, more usually at least 24 
nucleotides, typically at least 28 nucleotides, more typically at least 32 nucleotides, and 
preferably at least 36 nucleotides or more nucleotides. There are a number of different 
algorithms known in the art which can be used to measure nucleotide sequence identity. 
For instance, polynucleotide sequences can be compared using Fasta, a program in GCG 
Version 6.6, Fasta provides alignments and percent sequence identity of the regions of the 
best overlap between the query and the search sequence (Pearson, 1980, herein 
incorporated by reference). For instance, percent sequence identity between nucleic acid 
sequences can be determined using Fasta with its default parameters (a word size of 6 
and the NOPAMfactor for the scoring matrix) as provided in GCG Version 6.1, herein 
incorporated by reference. 

The term "corresponding" as used herein means that a position is not only determined by 
the number of the preceding nucleotides and amino acids, respectively. The position of a 
given nucleotide or amino acid in accordance with the present invention which may be 
deleted, substituted or comprise one or more additional nucleotide(s) may vary due to 
deletions or additional nucleotides or amino acids elsewhere tn the gene or the 
polypeptide. Thus, under a "corresponding position" in accordance with the present 
invention it is to be understood that nucleotides or amino acids may differ in the indicated 
number but may still have similar neighboring nucleotides or amino acids. Said nucleotides 
or amino acids which may be exchanged, deleted or comprise additional nucleotides or 
amino acids are also comprised by the term "corresponding position". Said nucleotides or 
amino acids may for instance together with their neighbors form sequences which may be 
involved in the regulation of gene expression, stability of the corresponding RNA or RNA 
editing, as well as encode functional domains or motifs of the protein of the invention. 

By, e.g., "position 3709/3710" it is meant that said polynucleotide comprises one or more 
additional nucleotide(s) which are inserted between positions 3709 and position 3710 of 
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the corresponding wild type version of said polynucleotide. The same applies mutatis 
mutandis to all other position numbers referred to in the above embodiment which are 
drafted in the same format, i.e. two consecutive position numbers separated by a slash (/). 
By, e.g., "position 7424 to 7427" is meant that said polynucleotide comprises one or more 
deleted nucleotides which are deleted between positions 7424 and position 7427 of the 
corresponding wild type version of said polynucleotide and/or one or more additional 
nucleotjde(s) which are inserted between positions 7424 and position 7427 of the 
corresponding wild type version of said polynucleotide. The same applies mutatis 
mutandis to all other position numbers referred to in the above embodiment which are 
drafted in the same format. 

The numbering of the polymorphisms refers to the aligned and joined genomic sequences 
AF280107.1 and AC005020.2, wherein the T at position 174832 (which has been 
numbered +8613) of the sequence AF280107.1 refers to position 27340 of the sequence 
AC005020.2, The nucleotide A at position 27341 of the sequence AC005020.2 has been 
numbered +8614. Numbering of polymorphisms to a position corresponding to a position 
up to +8613 refers to the genomic sequence AF280107.1, numbering of polymophisms to 
a position corresponding to position +8614 and greater refer to the genomic sequence 
AC005020.2. 

In accordance with the present invention, the mode and population distribution of genetic 
variations in the CYP3A5 gene has been analyzed by sequence analysis of relevant 
regions of the human said gene from many different individuals. It is a well known fact that 
genomic DNA of individuals, which harbor the individual genetic makeup of all genes, 
including the CYP3A5 gene, can easily be purified from individual blood samples. These 
individual DNA samples are then used for the analysis of the sequence composition of the 
alleles of the CYP3A5 gene that are present in the individual which provided the blood 
sample. The sequence analysis was carried out by PGR amplification of relevant regions 
of said genes, subsequent purification of the PGR products, followed by automated DNA 
sequencing with established methods (e.g. ABI dyeterminator cycle sequencing). 
One important parameter that had to be considered in the attempt to determine the 
individual genotypes and identify novel variants of the GYP3A5 gene by direct DNA- 
sequencing of PGR-products from human blood genomic DNA is the fact that each human 
harbors (usually, with very few abnormal exceptions) two gene copies of each autosomal 
gene (diploidy). Because of that, great care had to be taken in the evaluation of the 
sequences to be able to identify unambiguously not only homozygous sequence variations 
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but also heterozygous variations. The details of the different steps in the identification and 
characterization of novel polymorphisms in the CYP3A5 gene (homozygous and 
heterozygous) are described in the Examples below. 

Over the past 20 years, genetic heterogeneity has been increasingly recognized as a 
significant source of variation in drug response. Many scientific communications (Meyer, 
Ann. Rev. Pharmacol. Toxicol. 37 (1997), 269-296 and West, J. Clin. Pharmacol. 37 
(1997), 635-648) have clearly shown that some drugs work better or may even be highly 
toxic in some patients than in others and that these variations in patient's responses to 
drugs can be related to molecular basis. This "pharmacogenomic" concept spots 
correlations between responses to drugs and genetic profiles of patient's (Marshall, Nature 
Biotechnology, 15 (1997), 954-957; Marshall, Nature Biotechnology, 15 (1997), 1249- 
1252). In this context of population variability with regard to drug therapy, 
pharmacogenomics has been proposed as a tool useful in the identification and selection 
of patients which can respond to a particular drug without side effects. This 
identification/selection can be based upon molecular diagnosis of genetic polymorphisms 
by genotyping DMA from leukocytes in the blood of patient, for example, and 
characterization of disease (Bertz, Clin. Pharmacokinet. 32 (1997), 210-256; Engel, J. 
Chromatogra. B. Biomed. Appl. 678 (1996), 93-103). For the founders of health care, such 
as health maintenance organizations in the US and government public health services in 
many European countries, this pharmacogenomics approach can represent a way of both 
improving health care and reducing overheads because there is a large cost to 
unnecessary drugs, ineffective drugs and drugs with side effects. 

The mutations in the variant genes of the invention sometime result in amino acid 
deletion(s), insertion(s) and in particular in substitution(s) either alone or in combination. It 
is of course also possible to genetically engineer such mutations in wild type genes or 
other mutant forms. Methods for introducing such modifications In the DMA sequence of 
said genes are well known to the person skilled in the art; see, e.g., Sambrook, Molecular 
Cloning A Laboratory Manual, Cold Spring Harbor Laboratory (1989) N.Y. 

For the investigation of the nature of the alterations in the amino acid sequence of the 
polypeptides of the invention software programs may be used such as RASMOL that are 
obtainable from the Internet. Furthermore, folding simulations and computer redesign of 
structural motifs can be performed using other appropriate computer programs (Olszewski, 
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Proteins 25 (1996), 286-299; Hoffman, Comput. AppL Biosci. 11 (1995), 675-679). 
Connputers can be used for the conformational and energetic analysis of detailed protein 
models (Monge, J. IVIol. Biol. 247 (1995), 995-1012; Renouf, Adv. Exp, Med. Biol. 376 
(1995), 37-45). These analysis can be used for the identification of the influence of a 
particular mutation on binding and/or processing of drugs. 

Usually, said amino acid deletion, addition or substitution in the amino acid sequence of 
the protein encoded by the polynucleotide of the invention is due to one or more nucleotide 
substitution, insertion or deletion, or any combinations thereof. Preferably said nucleotide 
substitution, insertion or deletion may result in amino acid substitutions of HGLFK to 
YGTF. (with the period meaning termination) at a position corresponding to position 30 to 
34 of the CYP3A5 polypeptide (Accession No: NP„000768.1), in an amino acid 
substitution of S to Y at a position corresponding to position 100 of the CYP3A5 
polypeptide (Accession No: NP_000768.1), in amino acid substitutions of TYD to YL. (with 
the period meaning termination) at a position corresponding to position 346 to 348 of the 
CYP3A5 polypeptide (Accession No: NP_000768.1), or in an amino acid substitution of T 
to N at a position corresponding to position 398 of the CYP3A5 polypeptide (Accession 
No: NP„000768.1). ' 

The mutations in the CYP3A5 gene detected in accordance with the present invention are 
listed in Table 2A-E. The methods of the mutation analysis followed standard protocols 
and are described in detail in the Examples. In general such methods are to be used in 
accordance with the present invention for evaluating the phenotypic spectrum as well as 
the overiapping clinical characteristics of diseases or conditions related to dysfunctions 
and diseases related to the drug metabolism. Advantageously, the characterization of said 
mutants may form the basis of the development of improved drugs, such as drugs which 
are used e.g. in cancer therapy and diseases including cardiovascular diseases, diabetes 
and AIDS. Said methods encompass for example haplotype analysis, single-strand 
conformation polymorphism analysis (SSCA), PGR and direct sequencing, or TaqMan® 
analysis. On the basis of thorough clinical characterization of many patients the 
phenotypes can then be correlated to these mutations as well as to mutations that had 
been described eariier, for example in Jounaidi, Biochem Biophys Res Commun, 221, pp. 
466-470, 1996. 
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Also comprised by the polynucleotides referred to in the present invention are 
polynucleotides which comprise at least two, preferably at least three, of the 
polynucleotides specified hereinabove, i.e. polynucleotides having a nucleotide sequence 
which contains at least two, preferably three of the mutations comprised by the above 
polynucleotides or listed in the tables below. Thus, the haplotype determined in 
accordance with the present invention can be characterized by at least two, preferably 
three of said mutations in the CYP3A5 locus. Further, the polynucleotide of the invention 
may further comprise at least one nucleotide deletion, addition and/or substitution other 
than those specified hereinabove, for example those described in the prior art; e.g., in 
Jounaidi, Biochem Biophys Res Commun, 221, pp. 466-470, 1996, in Pauiussen, 
Pharmacogenetics 10, pp. 415-424, 2000, in-Kuehl, 2001, Nature Genetics 27: 383-391, or 
in Chou, 2001, Drug Metab Dispos 29: 1205-1209. 

This allows the study of synergistic effects of said mutations in the CYP3A5 gene and/or a 
polypeptide encoded by said polynucleotide on the pharmacological profile of drugs in 
patients who bear such mutant forms of the gene or similar mutant forms that can be 
mimicked by the above described proteins. It is expected that the analysis of said 
synergistic effects provides deeper insights into the onset of dysfunctions or diseases 
related to drug metabolism as described supra. From said deeper insight the development 
of diagnostic and pharmaceutical compositions related to dysfunctions or diseases related 
to drug metabolism will greatly benefit. 

Moreover, it has been surprisingly found that the so called positive predictive power for 
CYP3A5 dysfunctions or dysregulations can be significantly increased based on the 
polynucleotides of the present invention and thus allows a reliable prediction in contrast to 
positive predictive power based on the prior art. The increased CYP3A5 protein 
expression in all except one liver samples (17/18) identified in accordance with the present 
invention and described in detail in the examples below co-segregates with a haplotype 
which consists of at least three variants (ch-v-021, ch-v-026, ch-v-015) with distinct 
locations within or upstream of the gene locus. Genotyping these three variants has in no 
case led to the generation of false-positive predictions resulting in an estimated positive 
predictive power for the 3-variant genotype of about 99.95 %. This is in striking contrast to 
the positive predictive power determined for the haplotype described by Pauiussen, 
Pharmacogenetics 10, pp. 415-424, 2000 which is about 65 %. Moreover, based on the 
polynucleotides of the invention and as described in the examples below, it has been 
found that the SNPs described by Pauiussen, Pharmacogenetics 10, pp. 415-424, 2000 
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are located in contrary to what is reported in said document approximately 20 l<b upstream 
of the transcriptional start site of the CYP3A5 gene in a sequence 5' to a CYP3A5 
pseudogene locus. 

Therefore, the haplotypes characterized on the basis of the polynucleotides of the present 
invention fulfil the criteria expected from a reliable marker of CYP3A5 expression. As Is 
evident to the person skilled in the art, the genetic knowledge deduced from the present 
invention can now be used to exactly and reliably characterize the genotype of a patient. 
Advantageously, diseases or a prevalence for a disease which are associated with 
CYP3A5 dysfunction or dysregulation, such as cancer, diseases including cardiovascular 
diseases, diabetes and AIDS, can be predicted and preventive or therapeutical measures 
can be applied accordingly. Moreover in accordance with the foregoing, in cases where a 
given drug takes an unusual effect, a suitable individual therapy can be designed based on 
the knowledge of the individual genetic makeup of a subject with respect to the 
polynucleotides of the invention and improved therapeutics can be developed as will be 
further discussed below. 

Finally, the polynucleotides and polypeptides referred to in accordance with the present 
invention are also useful as forensic markers, which improve the identification of subjects 
which have been murdered or killed by, for example, a crime of violence or any other 
violence and can not be identified by the well known conventional forensic methods. The 
application of forensic methods based on the detection of the polymorphisms comprised 
by the polynucleotides of this invention in the genome of a subject are particularly well 
suited in cases where a (dead) body is disfigured in a severe manner such that 
identification by other body characteristics such as the features of the face is not possible. 
This is the case, for example, . rfor corpse found in water - Which are usually entirely 
disfigured. Advantageously methods which are based on the provision of the 
polynucleotides of the invention merely require a minimal amount of tissues or cells in 
order to be carried out. Said tissues or cells may be blood droplets, hair roots, epidermal 
scales, saliva droplets, sperms etc. Since only such a minimal amount of tissues or cells 
are required for the identification of a subject, the polymorphisms comprised by the 
polynucleotides of this invention can be also used as forensic markers in order to prove 
someone guilt of a crime, such as a violation or a ravishment. Moreover, the 
polymorphisms comprised by the polynucleotides of this invention can be used to proof 
paternity. In accordance with the forensic methods referred to herein the presence or 
absence of the polynucleotides of the invention is determined and compared with a 
reference sample which is unambiguously derived from the subject to be identified. The 
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forensic methods which require detection of the presence or absence of the 
polynucleotides of the invention in a sample of a subject the polymorphisms comprised by 
the polynucleotides of this invention can be for example PCR-based techniques which are 
particularly well suited in cases where only a minimal amount of tissues or cells are 
available as forensic samples. On the other hand, where enough tissue or cells are 
available, hybridization based techniques may be performed in order to detect the 
presence or absence of a polynucleotide of this invention. These techniques are well 
known by the person skilled in the art and can be adopted to the individual purposes 
referred to herein without further ado. In conclusion, thanks to the present invention 
forensic means which allow improved and reliable predictions as regards the 
aforementioned aspects are now available, - 

In line with the foregoing, preferably, the polynucleotide of the present Invention is 
associated with cancer or diseases including cardiovascular diseases, diabetes and AIDS. 

The term "cancer" used herein is very well known and characterized in the art. Several 
variants of cancer exist and are comprised by said term as meant in accordance with the 
Invention. For a detailed list of symptoms which are indicative for cancer it is referred to 
text book knowledge, e.g. Pschyrembel. 

In a further embodiment the present invention relates to a polynucleotide which is DNA or 
RNA, 

The polynucleotide of the^ invention may be, e.g., DNA, cDNA, genomic DNA, RNA or 
synthetically produced DNA or RNA or a recomblnantly produced chimeric nucleic acid 
molecule comprising any of those polynucleotides either alone or in combination. 
Preferably said polynucleotide is part of a vector, particularly plasmids, cosmids, viruses 
and bacteriophages used conventionally in genetic engineering that comprise a 
polynucleotide of the invention. Such vectors may comprise further genes such as marker 
genes which allow for the selection of said vector in a suitable host cell and under suitable 
conditions. 

The invention furthermore relates to a gene comprising the polynucleotide of the invention. 
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It is well known in the art that genes comprise structural elements which encode an amino 
acid sequence as well as regulatory elements which are involved in the regulation of the 
expression of said genes. Structural elements are represented by exons which may either 
encode an amino acid sequence or which may encode for RNA which is not encoding an 
amino acid sequence but is nevertheless involved in RNA function, e.g. by regulating the 
stability of the RNA or the nuclear export of the RNA. 

Regulatory elements of a gene may comprise promoter elements or enhancer elements 
both of which could be involved in transcriptional control of gene expression. It is very well 
known in the art that a promoter is to be found upstream of the structural elements of a 
gene. Regulatory elements such as enhancer elements, however, may be found 
distributed over the entire locus of the gene. Said elements could be reside, e.g., in 
introns, regions of genomic DNA which separate the exons of a gene. Said introns may 
comprise further regulatory elements which are required for proper gene expression. 
Introns are usually transcribed together with the exons of a gene resulting in a nascent 
RNA transcript which contains both, exon and intron sequences. The intron encoded RNA 
sequences are usually removed by a process known as RNA splicing. However, said 
process also requires regulatory sequences present on a RNA transcript, said regulatory 
sequences may be encoded by the introns. 

In addition, besides their function in transcriptional control and control of proper RNA 
processing and/or stability, regulatory elements of a gene could be also involved in the 
control of genetic stability of a gene locus. Said elements control, e.g., recombination 
events or serve to maintain a certain structure of the DNA or the arrangement of DNA in a 
chromosome. 

Therefore, polymorphisms can occur in exons of a gene which encode an amino acid 
sequence as discussed supra as well as in regulatory regions which are involved in the 
above discussed process. The analysis of the nucleotide sequence of a gene locus in its 
entirety including, e.g., introns is in light of the above desirable. It has been found based 
on the polymorphisms comprised by the polynucleotides of the present invention that the 
mechanism of the increased expression of CYP3A5 protein in most Caucasians livers 
described in the examples below may involve enhanced transcription and stabilisation of 
the gene's transcripts. 

Therefore, in a furthermore preferred embodiment of the gene of the invention a nucleotide 
deletion, addition and/or substitution results in altered expression of the variant gene 
compared to the corresponding wild type gene. 
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In another ennbodiment the present invention relates to a vector comprising the 
polynucleotide of the invention or the gene of the invention. 

Said vector may be, for example, a phage, plasmid, viral or retroviral vector. Retroviral 
vectors may be replication competent or replication defective. In the latter case, viral 
propagation generally will occur only in complementing host/cells. 

The polynucleotides or genes of the invention may be joined to a vector containing 
selectable markers for propagation in a host. Generally, a plasmid vector is introduced in a 
precipitate such as a calcium phosphate precipitate, or in a complex with a charged lipid or 
in carbon-based clusters. Should the vector be a virus, it may be packaged in vitro using 
an appropriate packaging cell line prior to application to host cells. 

In a more preferred embodiment of the vector of the Invention the polynucleotide is 
operatively linked to expression control sequences allowing expression in prokaryotic or 
eukaryotic cells or isolated fractions thereof. 

Expression of said polynucleotide comprises transcription of the polynucleotide, preferably 
into a translatable mRNA. Regulatory elements ensuring expression in eukaryotic cells, 
preferably mammalian cells, are well known to those skilled in the art. They usually 
comprise regulatory sequences ensuring initiation of transcription and optionally poly-A 
signals ensuring termination of transcription and stabilization of the transcript. Additional 
regulatory elements may include transcriptional as well as translational enhancers. 
Possible regulatory elements permitting expression in prokaryotic host cells comprise, e.g., 
the lac, trp or tac promoter in E. coli, and examples for regulatory elements permitting 
expression in eukaryotic host cells are the AOX1 or GAL1 promoter in yeast or the CMV-, 
SV40- , RSV-promoter (Rous sarcoma virus), CMV-enhancer, SV40-enhancer or a globin 
intron in mammalian and other animal cells. Beside elements which are responsible for the 
Initiation of transcription such regulatory elements may also comprise transcription 
termination signals, such as the SV40-poly-A site or the tk-poly-A site, downstream of the 
polynucleotide. In this context, suitable expression vectors are known in the art such as 
Okayama-Berg cDNA expression vector pcDV1 (Pharmacia), pCDMS, pRc/CMV, pcDNAI, 
pcDNAS (In-vitrogene), pSPORTI (GIBCO BRL). Preferably, said vector is an expression 
vector and/or a gene transfer or targeting vector. Expression vectors derived from viruses 
such as retroviruses, vaccinia virus, adeno-associated virus, herpes viruses, or bovine 
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papilloma virus, may be used for delivery of the polynucleotides or vector of the invention 
into targeted cell population. Methods which are well known to those skilled in the art can 
be used to construct recombinant viral vectors; see, for example, the techniques described 
in Sambrook, IVIolecular Cloning A Laboratory Manual, Cold Spring Harbor Laboratory 
(1989) N.Y. and Ausubel, Current Protocols in Molecular Biology, Green Publishing 
Associates and Wiley Interscience, N.Y. (1994). Alternatively, the polynucleotides and 
vectors of the invention can be reconstituted into liposomes for delivery to target cells. 

The term "isolated fractions thereof refers to fractions of eukaryotic or prokaryotic cells or 
tissues comprising said cells which are capable of transcribing or transcribing and 
translating RNA from the vector of the invention. Said fractions comprise proteins which 
are required for transcrition of RNA or transcription of RNA and translation of said RNA 
into a polypeptide. Said isolated fractions may be, e.g., nuclear and cytoplasmic fractions 
of eukaryotic cells such as of reticulocytes. 

The present invention furthermore relates to a host cell genetically engineered with the 
polynucleotide of the invention, the gene of the invention or the vector of the invention. 

Said host cell may be a prokaryotic or eukaryotic cell; see supra. The polynucleotide or 
vector of the invention which is present In the host ceil may either be integrated into the 
genome of the host cell or it may be maintained extrachromosomally. In this respect, it is 
also to be understood that the recombinant DNA molecule of the invention can be used for 
"gene targeting" and/or "gene replacement", for restoring a mutant gene or for creating a 
mutant gene via homologous recombination; see for example Mouellic, Proc. Natl. Acad. 
Sci. USA, 87 (1990), 4712-4716; Joyner, Gene Targeting, A Practical Approach, Oxford 
University Press. 

The host cell can be any prokaryotic or eukaryotic cell, such as a bacterial, insect, fungal, 
plant, animal or human cell. Preferred fungal cells are, for example, those of the genus 
Saccharomyces, in particular those of the species S. cerevisiae. The term "prokaryotic" is 
meant to include all bacteria which can be transformed or transfected with a polynucleotide 
for the expression of a variant polypeptide of the invention. Prokaryotic hosts may include 
gram negative as well as gram positive bacteria such as, for example, E. coli, S. 
typhimurium, Serratia marcescens and Bacillus subtilis. A polynucleotide coding for a 
mutant form of variant polypeptides of the invention can be used to transform or transfect 
the host using any of the techniques commonly known to those of ordinary skill in the art. 
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Methods for preparing fused, operably linked genes and expressing them in bacteria or 
animal cells are well-known in the art (Sambrook, supra). The genetic constructs and 
methods described therein can be utilized for expression of variant polypeptides of the 
invention in, e.g., prokaryotic hosts. In general, expression vectors containing promoter 
sequences which facilitate the efficient transcription of the inserted polynucleotide are 
used in connection with the host. The expression vector typically contains an origin of 
replication, a promoter, and a terminator, as well as specific genes which are capable of 
providing phenotypic selection of the transformed cells. The transformed prokaryotic hosts 
can be grown in fermentors and cultured according to techniques known in the art to 
achieve optimal cell growth. The proteins of the invention can then be isolated from the 
grown medium, cellular lysates, or cellular membrane fractions. The isolation and 
purification of the microbially or otherwise expressed polypeptides of the invention may be 
by any conventional means such as, for example, preparative chromatographic 
separations and immunological separations such as those involving the use of monoclonal 
or polyclonal antibodies. 

Thus, in a further embodiment the invention relates to a method for producing a molecular 
variant polypeptide or fragment thereof comprising culturing the above described host cell; 
and recovering said protein or fragment from the culture. 

In another embodiment the present invention relates to a method for producing cells 
capable of expressing a molecular variant polypeptide comprising genetically engineering 
cells with the polynucleotide of the invention, the gene of the invention or the vector of the 
invention , ' 

The cells obtainable by the method of the invention can be used, for example, to test drugs 
according to the methods described in D. L. Spector, R. D. Goldman, L. A. Leinwand, 
Cells, a Lab manual, CSH Press 1998. Furthermore, the cells can be used to study known 
drugs and unknown derivatives thereof for their ability to complement the deficiency 
caused by mutations in the CYP3A5 gene. For these embodiments the host cells 
preferably lack a wild type allele, preferably both alleles of the CYP3A5 gene and/or have 
at least one mutated from thereof. Ideally, the gene comprising an allele as comprised by 
the polynucleotides of the invention could be introduced into the wild type locus by 
homologous replacement. Alternatively, strong overexpression of a mutated allele over 
the normal allele and comparison with a recombinant cell line overexpressing the normal 
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allele at a similar level may be used as a screening and analysis system. The cells 
obtainable by the above-described method may also be used for the screening methods 
referred to herein below. 

Furthermore, the invention relates to a polypeptide or fragment thereof encoded by the 
polynucleotide of the invention, the gene of the invention or obtainable by the method 
described above or from cells produced by the method described above. 

In this context it is also understood that the variant polypeptide of the invention can be 
further modified by conventional methods known in the art. fey providing said variant 
proteins according to the present invention it is also possible to determine the portions 
relevant for their biological activity or inhibition of the same. The terms "polypeptide" and 
"protein" as used herein are exchangeable. Moreover, what is comprised by said terms is 
standard textbook knowledge. 

The present invention furthermore relates to an antibody which binds specifically to the 
polypeptide of the invention. 

Advantageously, the antibody specifically recognizes or binds an epitope containing one or 
more amino acid substitution(s) as defined above. Antibodies against the variant 
polypeptides of the invention can be prepared by well known methods using a purified 
protein according to the invention or a (synthetic) fragment derived therefrom as an 
antigen. Monoclonal antibodies can be prepared, for example, by the techniques as 
originally described in Kohler and Milstein, Nature 256 (1975), 495, and^Qalfre, Meth. 
EnzymoL 73 (1981), 3, which comprise the fusion of mouse myeloma cells to spleen cells 
derived from immunized mammals. In a preferred embodiment of the invention, said 
antibody is a monoclonal antibody, a polyclonal antibody, a single chain antibody, human 
or humanized antibody, primatized, chimerized or fragment thereof that specifically binds 
said peptide or polypeptide also including bispecific antibody, synthetic antibody, antibody 
fragment, such as Fab, Fv or scFv fragments etc., or a chemically modified derivative of 
any of these. Furthermore, antibodies or fragments thereof to the aforementioned 
polypeptides can be obtained by using methods which are described, e.g., in Harlow and 
Lane "Antibodies, A Laboratory Manual", CSH Press, Cold Spring Harbor, 1988. These 
antibodies can be used, for example, for the immunoprecipitation and immunolocalization 
of the variant polypeptides of the invention as well as for the monitoring of the presence of 
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said variant polypeptides, for example, in recombinant organisms, and for tfie identification 
of compounds interacting with the proteins according to the invention. For example, 
surface plasmon resonance as employed in the BIAcore system can be used to Increase 
the efficiency of phage antibodies which bind to an epitope of the protein of the invention 
(Schier, Human Antibodies Hybridomas 7 (1996), 97-105; Malmborg, J. Immunol. Methods 
183 (1995), 7-13). 

In a preferred embodiment the antibody of the present invention specifically recognizes an 
epitope containing one or more amino acid substitution(s) resulting from a nucleotide 
exchange as defined supra. 

Antibodies which specifically recognize modified amino acids such as phospho-Tyrosine 
residues are well known in the art. Similarly, in accordance with the present invention 
antibodies which specifically recognize even a single amino acid exchange in an epitope 
may be generated by the well known methods described supra. 

In light of the foregoing, in a more preferred embodiment the antibody of the present 
invention is monoclonal or polyclonal. 

The invention also relates to a transgenic non-human animal comprising at least one 
polynucleotide of the invention, the gene of the invention or the vector of the invention as 
described supra. 

The present invention also encompasses a method for the production of a transgenic non- 
human animal comprising introduction of a polynucleotide or vector of the invention into a 
germ cell, an embryonic cell, stem cell or an egg or a cell derived therefrom. The non- 
human animal can be used in accordance with the method of the invention described 
below and may be a non-transgenic healthy animal, or may have a disease or disorder, 
preferably a disease caused by at least one mutation in the gene of the invention. Such 
transgenic animals-are well suited for, e.g., pharmacological studies of drugs in connection 
with variant forms of the above described variant polypeptides since these polypeptides or 
at least their functional domains are conserved between species in higher eukaryotes, 
particularly in mammals. Production of transgenic embryos and screening of those can be 
performed, e.g., as described by A. L. Joyner Ed., Gene Targeting, A Practical Approach 
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(1993), Oxford University Press. Tlie DNA of the embryos can be analyzed using, e.g., 
Soutinern blots with an appropriate probe or based on PGR techniques. 



A transgenic non-human animal in accordance with the invention may be a transgenic 
mouse, rat, hamster, dog, monkey, rabbit, pig, frog, nematode such as Caenorhabditis 
elegans, fruitfly such as Drosophila melanogaster or fish such as torpedo fish or zebrafish 
comprising a polynucleotide or vector of the invention or obtained by the method described 
above, preferably wherein said polynucleotide or vector is stably integrated into the 
genome of said non-human animal, preferably such that the presence of said 
polynucleotide or vector leads to the expression of the variant polypeptide of the invention. 
It may comprise one or several copies of the same or different polynucleotides or genes of 
the invention. This animal has numerous utilities, including as a research model for cancer 
or diseases including cardiovascular diseases, diabetes and AIDS or any other disease 
caused by as dysfunction or dysregulation of the polynucleotides or polypeptides of the 
invention research and therefore, presents a novel and valuable animal in the 
development of therapies, treatment, etc. for cancer diseases or diseases including 
cardiovascular diseases, diabetes and AIDS or any other disease caused by as 
dysfunction or dysregulation of the polynucleotides or polypeptides of the invention. 
Accordingly, In this instance, the mammal is preferably a laboratory animal such as a 
mouse or rat. 

Thus, in a preferred embodiment the transgenic non-human animal of the invention is a 
mouse, a rat or a zebrafish. 

Numerous reports revealed that said animals are particularly well suited as model 
organisms for the investigation of the drug metabolism and its deficiencies or cancer. 
Advantageously, transgenic animals can be easily created using said model organisms, 
due to the availability of various suitable techniques well known In the art. 

The invention also relates to a solid support comprising one or a plurality of the 
polynucleotide, the gene, the vector, the polypeptide, the antibody or the host cell of the 
invention in immobilized form.' 

The term "solid support" as used herein refers to a flexible or non-flexible support that is 
suitable for carrying said immobilized targets. Said solid support may be homogenous or 
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inhomogeneous. For example, said solid support may consist of different materials having 
the same or different properties with respect to flexibility and immobilization, for instance, 
or said solid support may consist of one material exhibiting a plurality of properties also 
comprising flexibility and immobilization properties. Said solid support may comprise glass- 
, polypropylene- or silicon-chips, membranes, oligonucleotide-conjugated beads or bead 
arrays. 

The term "immobilized" means that the molecular species of interest is fixed to a solid 
support, preferably covalently linked thereto. This covalent linkage can be achieved by 
different means depending on the molecular nature of the molecular species. Moreover, 
the molecular species may be also fixed- on the solid support by electrostatic forces, 
hydrophobic or hydrophilic interactions or Van-der-Waals forces. The above described 
physico-chemical interactions typically occur in interactions between molecules. For 
example, biotinylated polypeptides may be fixed on a avidin-coated solid support due to 
interactions of the above described types. Further, polypeptides such as antibodies, may 
be fixed on an antibody coated solid support. Moreover, the immobilization is dependent 
on the chemical properties of the solid support. For example, the nucleic acid molecules 
can be Immobilized on a membrane by standard techniques such as UV-crosslinking or 
heat. 

In a preferred embodiment of the invention said solid support is a membrane, a glass- or 
poylpropylene- or silicon-chip, are oligonucleotide-conjugated beads or a bead array, 
which is assembled on an optical filter substrate. 

Moreover, the present invention relates to an in vitro method for identifying a 
polymorphism said method comprising the steps of: 

(a) Isolating a polynucleotide or the gene of the invention from a plurality of subgroups 
of Individuals, wherein one subgroup has no prevalence for a CYP3A5 associated 
disease and at least one or more further subgroup(s) do have prevalence for a 
CYP3A5 associated disease; and 

(b) identifying a polymorphism by comparing the nucleic acid sequence of said 
polynucleotide or said gene of said one subgroup having no prevalence for a 
CYP3A5 associated disease with said at least one or more further subgroup(s) 
having a prevalence for a CYP3A5 associated disease. 



22 



wo 02/053775 PCT/EPOl/15290 

The term "prevalence" as used herein means that individuals are susceptible for one or 
more disease(s) which are associated with CYP3A5 dysfuntion or dysregulation or could 
already have one or more of said disease(s). Thereby, one CYP3A5 associated disease 
can be used to determine the susceptibility for another CYP3A5 associated disease, e.g. 
impaired drug metabolism may be indicative for a prevalence for, e.g. cancer. Moreover, 
symptoms which are indicative for a prevalence for developing said diseases are very well 
known in the art and have been sufficiently described in standard textbooks such as 
Pschyrembel. 

Advantageously, polymorphisms according to the present invention which are associated 
with CYP3A5 dysfunction or dysregulation or one or more disease(s) based thereon 
should be enriched in subgroups of individuals which have a prevalence for said diseases 
versus subgroups which have no prevalence for said diseases. Thus, the above described 
method allows the rapid and reliable detection of polymorphisms which are indicative for 
one or more CYP3A5 associated disease(s) or a susceptibility therefor. Advantageously, 
due to the phenotypic preselection a large number of individuals having no prevalence 
might be screened for polymorphisms in general. Thereby, a reference sequences 
comprising polymorphisms which do not correlate to one or more CYP3A5 associated 
disease(s) can be obtained. Based on said reference sequences it is possible to efficiently 
and reliably determine the relevant polymorphisms. 

in a further embodiment the present invention relates to a method for identifying and 
obtaining a pro-drug or a drug capable of modulating the activity of a molecular variant of a 
CYP3A5 polypeptide comprising the steps of: 

(a) contacting the polypeptide, the solid support of the invention, a cell expressing a 
molecular variant gene comprising a polynucleotide of the invention, the gene or the 
vector of the invention in the presence of components capable of providing a 
detectable signal in response to drug activity with a compound to be screened for 
pro-drug or drug activity; and 

(b) detecting the presence or absence of a signal or increase or decrease of a signal 
generated from the pro-drug or the drug activity, wherein the absence, presence, 
increase or decrease of the signal is indicative for a putative pro-drug or drug. 

The term "compound" in a method of the invention includes a single substance or a 
plurality of substances which may or may not be identical. 
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Said connpound(s) may be chemically synthesized or produced via microbial fermentation 
but can also be comprised in, for example, samples, e.g., cell extracts from, e.g., plants, 
animals or microorganisms. Furthermore, said compounds may be known in the art but 
hitherto not known to be useful as an inhibitor, respectively. The plurality of compounds 
may be, e.g., added to the culture medium or injected into a c'ell or non-human animal of 
the invention. 

If a sample containing (a) compound(s) is identified in the method of the invention, then it 
is either possible to isolate the compound from the original sample identified as containing 
the compound in question or one can further subdivide the original sample, for example, if 
it consists of a plurality of different compounds, so as to reduce the number of different 
substances per sample and repeat the method with the subdivisions of the original sample. 
It can then be determined whether said sample or compound displays the desired 
properties, for example, by the methods described herein or in the literature (Specter et al., 
Cells manual; see supra). Depending on the complexity of the samples, the steps 
described above can be performed several times, preferably until the sample identified 
according to the method of the invention only comprises a limited number of or only one 
substance(s). Preferably said sample comprises substances of similar chemical and/or 
physical properties, and most preferably said substances are Identical. The methods of the 
present invention can be easily performed and designed by the person skilled in the art, for 
example in accordance with other cell based assays described in the prior art or by using 
and modifying the methods as described herein. Furi:hermore, the person skilled in the art 
will readily recognize which further compounds may be used in order to perform the 
methods of the invention, for example, enzymes, if necessary, that convert a certain 
compound into a precursor. Such adaptatich of the method of the invention is well within 
the skill of the person skilled in the art and can be performed without undue 
experimentation. 

Compounds which can be used In accordance with the present invention include peptides, 
proteins, nucleic acids, antibodies, small organic compounds, ligands, peptidomimetics, 
PNAs and the like. Said compounds may act as agonists or antagonists of the invention. 
Said compounds can also be functional derivatives or analogues of known drugs. Methods 
for the preparation of chemical derivatives and analogues are well known to those skilled 
in the art and are described in, for example, Beilstein, Handbook of Organic Chemistry, 
Springer edition New York Inc., 175 Fifth Avenue, New York, N.Y. 10010 U.S.A. and 
Organic Synthesis, Wiley, New York, USA. Furthermore, said derivatives and analogues 
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can be tested for their effects according to methods known in the art or as described. 
Furthermore, peptide mimetics and/or computer aided design of appropriate drug 
derivatives and analogues can be used, for example, according to the methods described 
below. Such analogs comprise molecules may have as the basis structure of known 
CYP3A5 substrates and/or inhibitors and/or modulators; see infra. 

Appropriate computer programs can be used for the identification of interactive sites of a 
putative inhibitor and the polypeptides of the invention by computer assistant searches for 
complementary structural motifs (Fassina, Immunomethods 5 (1994), 114-120). Further 
appropriate computer systems for the computer aided design of protein and peptides are 
described in the prior art, for example, in Berry, Biochem. Soc. Trans. 22 (1994), 1033- 
1036; Wodak, Ann. N. Y. Acad. Sci. 501 (1987), 1-13; Pabo, Biochemistry 25 (1986), 
5987-5991, The results obtained from the above-described computer analysis can be used 
in combination with the method of the invention for, e.g., optimizing known inhibitors, 
analogs, antagonists or agonists. Appropriate peptidomimetics and other inhibitors can 
also be identified by the synthesis of peptidomimetic combinatorial libraries through 
successive chemical modification and testing the resulting compounds, e.g., according to 
the methods described herein. Methods for the generation and use of peptidomimetic 
combinatorial libraries are described in the prior art, for example in Ostresh, Methods in 
Enzymology 267 (1996), 220-234 and Dorner, Bioorg. Med. Chem. 4 (1996), 709-715. 
Furthermore, the three-dimensional and/or crystallographic structure of said compounds 
and the polypeptides of the invention can be used for the design of peptidomimetic drugs 
(Rose, Biochemistry 35 (1996), 12933-12944; Rutenber, Bioorg. Med. Chem. 4 (1996), 
1545-1558). It is very well known how to obtain said compounds, e.g. by chemical or 
biochemical standard techniques. Thus, also comprised by the method of the invention are 
means of making or producing said compounds. In summary, the present invention 
provides methods for identifying and obtaining compounds which can be used in specific 
doses for the treatment of specific forms of CYP3A5 associated diseases, e.g. 
dysfunctions of the drug metabolism or cancer. 

The above definitions apply mutatis mutandis to all of the methods described in the 
following. 

In a furither embodiment the present invention relates to a method for identifying and 
obtaining an inhibitor of the activity of a molecular variant of a CYP3A5 polypeptide 
comprising the steps of: 
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(a) contacting the protein, the solid support of the invention or a cell expressing a 
molecular variant gene comprising a polynucleotide or the gene or the vector of the 
invention in the presence of components capable of providing a detectable signal in 
response to drug activity with a compound to be screened for Inhibiting activity; and 

(b) detecting the presence or absence of a signal or increase or decrease of a signal 
generated from the inhibiting activity, wherein the absence or decrease of the signal 
is indicative for a putative inhibitor. 

In a preferred embodiment of the method of the invention said cell is a cell, obtained by the 
method of the invention or can be obtained from the transgenic non-human animal as 
described supra. 

In a still further embodiment the present invention relates to a method of identifying and 
obtaining a pro-drug or drug capable of modulating the activity of a molecular variant of a 
CYP3A5 polypeptide comprising the steps of: 

(a) contacting the host cell, the cell obtained by the method of the invention, the 
polypeptide or the solid support of the invention with the first molecule known to be 
bound by a CYP3A5 polypeptide to form a first complex of said polypeptide and 
said first molecule; 

(b) contacting said first complex with a compound to be screened, and 

(c) measuring whether said compound displaces said first molecule from said first 
complex. 

Advantageously, in said method said measuring step comprises measuring the formation 
of a second complex of said protein and said inhibitor candidate. Preferably, said 
measuring step comprises measuring the amount of said first molecule that is not bound to 
said protein. 

In a particularly preferred embodiment of the above-described method of said first 
molecule is a agonist or antagonist or a substrate and/or a inhibitor and/or a modulator of 
the polypeptide of the invention, e.g., with a radioactive or fluorescent label. 

In a still another embodiment the present invention relates to a method of identifying and 
obtaining an inhibitor capable of modulating the activity of a molecular variant of a 
CYP3A5 polypeptide comprising the steps of: 
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(a) contacting the host cell or the cell obtained by the method of the Invention, the 
protein or the solid support of the invention with the first molecule known to be 
bound by a CYP3A5 polypeptide to form a first complex of said protein and said first 
molecule; 

(b) contacting said first complex with a compound to be screened, and 

(c) measuring whether said compound displaces said first molecule from said first 
complex. 

In a preferred embodiment of the method of the invention said measuring step comprises 
measuring the formation of a second complex of said protein and said compound. 

In another preferred embodiment of the method of the invention said measuring step 
comprises measuring the amount of said first molecule that is not bound to said protein. 

In a more preferred embodiment of the method of the invention said first molecule is 
labeled. 

The invention furthermore relates to a method for the production of a pharmaceutical 
composition comprising the steps of the method as described supra; and the further step 
of formulating the compound identified and obtained or a derivative thereof in a 
pharmaceutically acceptable form. 

The therapeutically useful compounds identified according to the methods of the invention 
can be formulated and- administered to a patient as discussed above. For uses and 
therapeutic doses determined to be appropriate by one sl^illed in the art and for definitions 
of the term "pharmaceutical composition" see infra. 

Furthermore, the present invention encompasses a method for the preparation of a 
pharmaceutical composition comprising the steps of the above-described methods; and 
formulating a drug or pro-drug in the form suitable for therapeutic application and 
preventing or ameliorating the disorder of the subject diagnosed in the method of the 
invention. 

Drugs or pro-drugs after their in vivo administration are metabolized in order to be 
eliminated either by excretion or by metabolism to one or more active or inactive 



27 



wo 02/053775 PCT/EPOl/15290 

metabolites (Meyer, J. Pharmacokinet. Blopharm. 24 (1996), 449-459). Thus, rather than 
using the actual compound or inhibitor identified and obtained in accordance with the 
methods of the present Invention a corresponding formulation as a pro-drug can be used 
which is converted into its active in the patient. Precautionary measures that may be taken 
for the application of pro-drugs and drugs are described in the literature; see, for review, 
Ozama, J. Toxicol. Sci. 21 (1996), 323-329). 

In a preferred embodiment of the method of the present invention said drug or prodrug is a 
derivative of a medicament as defined hereinafter. 

The present invention also relates to a method of diagnosing -a disorder related to the 
presence of a molecular variant of the CYP3A5 gene or susceptibility to such a disorder 
comprising determining the presence of a polynucleotide or the gene of the invention in a 
sample from a subject. 

In accordance with this embodiment of the present invention, the method of testing the 
status of a disorder or susceptibility to such a disorder can be effected by using a 
polynucleotide gene or nucleic acid of the invention, e.g., in the form of a Southern or 
Northern blot or in situ analysis. Said nucleic acid sequence may hybridize to a coding 
region of either of the genes or to a non-coding region, e.g. intron. In the case that a 
complementary sequence is employed in the method of the invention, said nucleic acid 
molecule can again be used in Northern blots. Additionally, said testing can be done in 
conjunction with an actual blocking, e.g., of the transcription of the gene and thus is 
expected to have therapeutic relevance. Furthermore, a primer or oligonucleotide can also 
be used for hybridizing to one of the above mentioned CYP3A5 gene or corresponding 
mRNAs. The nucleic acids used for hybridization can, of course, be conveniently labeled 
by incorporating or attaching, e.g., a radioactive or other marker. Such markers are well 
known in the art. The labeling of said nucleic acid molecules can be effected by 
conventional methods. 

Additionally, the presence or expression of variant CYP3A5 gene can be monitored by 
using a primer pair that specifically hybridizes to either of the corresponding nucleic acid 
sequences and by carrying out a PGR reaction according to standard procedures. Specific 
hybridization of the above mentioned probes or primers preferably occurs at stringent 
hybridization conditions. The term "stringent hybridization conditions" is well known in the 
art; see, for example, Sambrook et ai., "Molecular Cloning, A Laboratory Manual" second 
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ed., CSH Press, Cold Spring Harbor, 1989; "Nucleic Acid Hybridisation, A Practical 
Approach", Hames and Higgins eds., IRL Press, Oxford, 1985, Furthermore, the mRNA, 
cRNA, cDNA or genonnic DNA obtained from the subject may be sequenced to identify 
mutations which may be characteristic fingerprints of mutations in the polynucleotide or the 
gene of the invention. The present invention further comprises methods wherein such a 
fingerprint may be generated by RFLPs of DNA or RNA obtained from the subject, 
optionally the DNA or RNA may be amplified prior to analysis, the methods of which are 
well known in the art. RNA fingerprints may be performed by, for example, digesting an 
RNA sample obtained from the subject with a suitable RNA-Enzyme, for example RNase 
Ti, RNase T2 or the like or a ribozyme and, for example, electrophoretically separating and 
detecting the RNA fragments as described above. 

Further modifications of the above-mentioned embodiment of the invention can be easily 
devised by the person skilled in the art, without any undue experimentation from this 
disclosure; see, e.g., the examples. An additional embodiment of the present invention 
relates to a method wherein said determination is effected by employing an antibody of the 
invention or fragment thereof. The antibody used in the method of the invention may be 
labeled with detectable tags such as a histidine flags or a biotin molecule. 

The invention relates to a method of diagnosing a disorder related to the presence of a 
molecular variant of a CYP3A5 gene or susceptibility to such a disorder comprising 
determining the presence of a polypeptide or the antibody of the invention in a sample 
from a subject. 

In a preferred embodiment of the above described method said disorder is cancer or 
diseases including cardiovascular diseases, diabetes and AIDS. 

In a preferred embodiment of the present invention, the above described method is 
comprising PGR, ligase chain reaction, restriction digestion, direct sequencing, nucleic 
acid amplification techniques, hybridization techniques or immunoassays. 
Said techniques are very well known in the art. 

Moreover, the invention relates to a method of detection of the polynucleotide or the gene 
of the invention in a sample comprising the steps of 
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(a) contacting the solid support described supra with the sample under conditions 
allowing interaction of the polynucleotide or the gene of the invention with the 
immobilized targets on a solid support and; 

(b) determining the binding of said polynucleotide or said gene to said immobilized 
targets on a solid support. 

The invention also relates to an in vitro method for diagnosing a disease comprising the 
steps of the method described supra, wherein binding of said polynucleotide or gene to 
said immobilized targets on said solid support is indicative for the presence or the absence 
of said disease or a prevalence for said disease. 

The invention furthermore relates to a diagnostic composition comprising the 
polynucleotide, the gene, the vector, the polypeptide or the antibody of the invention. 

In addition, the invention relates to a pharmaceutical composition comprising the 
polynucleotide, the gene, the vector, the polypeptide or the antibody of the invention. 

These pharmaceutical compositions comprising, e.g., the antibody may conveniently be 
administered by any of the routes conventionally used for drug administration, for instance, 
orally, topically, parenterally or by inhalation. Acceptable salts comprise acetate, 
methylester, HCl, sulfate, chloride and the like. The compounds may be administered in 
conventional dosage forms prepared by combining the drugs with standard pharmaceutical 
carriers according to conventional procedures. These procedures may involve mixing, 
granulating and compressing or dissolving the ingredients as appropriate to the desired 
preparation. It will be appreciated that the form and character of the pharmaceutically 
acceptable character or diluent is dictated by the amount of active ingredient with which it 
is to be combined, the route of administration and other well-known variables. The' 
carrier(s) must be "acceptable" in the sense of being compatible with the other ingredients 
of the formulation and not deleterious to the recipient thereof. The pharmaceutical carrier 
employed may be, for example, either a solid or liquid. Exemplary of solid carriers are 
lactose, terra alba, sucrose, talc, gelatin, agar, pectin, acacia, magnesium stearate, stearic 
acid and the like. Exemplary of liquid carriers are phosphate buffered saline solution, 
syrup, oil such as peanut oil and olive oil, water, emulsions, various types of wetting 
agents, sterile solutions and the like. Similarly, the carrier or diluent may include time delay 
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material well known to the art, such as glyceryl mono-stearate or glyceryl distearate alone 
or with a wax. 

The dosage regimen will be determined by the attending physician and other clinical 
factors; preferably in accordance with any one of the above described methods. As is well 
known in the medical arts, dosages for any one patient depends upon many factors, 
including the patient's size, body surface area, age, the particular compound to be 
administered, sex, time and route of administration, general health, and other drugs being 
administered concurrently. Progress can be monitored by periodic assessment. 
Furthermore, the use of pharmaceutical compositions which comprise antisense- 
oligonucleotides which specifically hybridize to RNA encoding mutated versions of the 
polynucleotitde or gene according to the invention or which comprise antibodies 
specifically recognizing a mutated polypeptide of the invention but not or not substantially 
the functional wild-type form is conceivable in cases in which the concentration of the 
mutated form in the cells should be reduced. 

Thanks to the present invention the particular drug selection, dosage regimen and 
corresponding patients to be treated can be determined in accordance with the present 
invention. The dosing recommendations will be indicated in product labeling by allowing 
the prescriber to anticipate dose adjustments depending on the considered patient group, 
with information that avoids prescribing the wrong drug to the wrong patients at the wrong 
dose. 

In another embodiment the present invention relates to the use of the polynucleotide, a 
polynucleotide comprising SEQ ID No: 104, a polynucleotide encoding a polypeptide 
comprising SEQ ID No: 145, the gene, the vector or the polypeptide of the invention, a 
polypeptide comprising SEQ ID No: 145 or the antibody of the invention for the preparation 
of a diagnostic composition for diagnosing a disease. 

A gene encoding a functional and expressible polypeptide of the invention can be 
introduced into the cells which in turn produce the protein of interest. Gene therapy, which 
is based on introducing therapeutic genes into cells by ex-vivo or in-vivo techniques is one 
of the most important applications of gene transfer. Suitable vectors and methods for in- 
vitro or in-vivo gene therapy are described in the literature and are known to the person 
skilled in the art; see, e.g., Giordano, Nature Medicine 2 (1996), 534-539; Schaper, Circ. 
Res. 79 (1996), 911-919; Anderson, Science 256 (1992), 808-813; Isner, Lancet 348 
(1996), 370-374; Muhlhauser, Circ. Res. 77 (1995), 1077-1086; Wang, Nature Medicine 2 
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(1996), 714-716; W094/29469; WO 97/00957 or Schaper, Current Opinion in 
Biotechnology 7 (1996), 635-640, and references cited therein. The gene may be designed 
for direct introduction or for introduction via liposomes, or viral vectors (e.g. adenoviral, 
retroviral) into the cell. Preferably, said cell is a germ line cell, embryonic cell, or egg cell or 
derived therefrom, most preferably said cell is a stem celLAs is evident from the above, it 
is preferred that in the use of the invention the nucleic acid sequence is operatively linl<ed 
to regulatory elements allowing for the expression and/or targeting of the polypeptides of 
the invention to specific cells. Suitable gene delivery systems that can be employed in 
accordance with the invention may include liposomes, receptor-mediated delivery 
systems, naked DNA, and viral vectors such as herpes viruses, retroviruses, 
adenoviruses, and adeno-associated viruses, among others. Delivery of nucleic acids to a 
specific site in the body for gene therapy may also be accomplished using a biolistic 
delivery system, such as that described by Williams (Proc. Natl. Acad. Sci. USA 88 (1991), 
2726-2729). Standard methods for transfecting cells with recombinant DNA are well known 
to those skilled in the art of molecular biology, see, e.g., WO 94/29469; see also supra. 
Gene therapy may be carried out by directly administering the recombinant DNA molecule 
or vector of the invention to a patient or by transfecting cells with the polynucleotide or 
vector of the invention ex vivo and infusing the transfected cells into the patient. 
A polynucleotide comprising SEQ ID No: 104 and a polypeptide comprising SEQ ID No: 
145 have already been described in Jounaidi et ai. (Jounaidi, Biochem Biophys Res 
Commun 221 (1996), 466-70). However, Jounaidi et aL have merely disclosed the 
respective amino acid and nucleotide sequences without making any suggestion towards 
the pharmaceutic and diagnostic value of said polynucleotide or polypeptide, in particular 
for those disorders and diseases referred to infra. 

In a further embodiment the present invention relates to the use of the polynucleotide, a 
polynucleotide comprising SEQ ID No: 104, a polynucleotide encoding a polypeptide 
comprising SEQ ID No: 145, the gene, the vector, the polypeptide of the invention, a 
polypeptide comprising SEQ ID No.: 145 or the antibody of the invention for the 
preparation of a pharmaceutical composition for treating a disease. 

In another embodiment the present invention encompasses the use of a polynucleotide 
selected from the group consisting of: 

(a) a polynucleotide having the nucleic acid sequence of SEQ ID NO: 082, 088, 
104, 112, 126, 131, or 140; 
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(b) a polynucleotide encoding a polypeptide having the amino acid sequence of 
SEQ ID No: 127, 132, 141 or 145; 

(c) a polynucleotide capable of hybridizing to a CYP3A5 gene, wherein said 
polynucleotide is having at least one additional nucleotide at a position 
corresponding to position 3709/3710 or 27131/27132 of the CYP3A5 gene 
(Accession No: AF280107.1, wherein position 166220 has been numbered 
+1 and position 174832 has been numbered +8613, and Accession No: 
AC005020.2, wherein position 27341 has been numbered +8614) or a 
nucleotide exchange at a position corresponding to position 7303 or 27289 of 
the CYP3A5 gene (Accession No: AF280107.1, wherein position 166220 has 
been numbered +1 and position 174832 has been numbered +8613, and 
Accession No: AC005020.2, wherein position 27341 has been numbered 
+8614); 

(d) a polynucleotide capable of hybridizing to a CYP3A5 gene, wherein said 
polynucleotide is having an additional G nucleotide at a position 
corresponding to position 3709/3710 of the CYP3A5 gene (Accession No: 
AF280107.1, wherein position 166220 has been numbered +1 and position 
174832 has been numbered +8613, and Accession No: AC005020.2, 
wherein position 27341 has been numbered +8614), an additional T 
nucleotide at a position corresponding to position 27131/27132 of the 
CYP3A5 gene (Accession No: AF280107.1, wherein position 166220 has 
been numbered +1 and position 174832 has been numbered +8613, and 
Accession No: AC005020.2, wherein position 27341 has been numbered 
+8614), or an A at a position corresponding to position 7303 or 27289 of the 
CYP3A5 gene (Accession No: AF280107.1, wherein position 166220 has 
been numbered +1 and position 174832 has been numbered +8613, and 
Accession No: AC005020.2, wherein position 27341 has been numbered 
+8614); 

for the preparation of a diagnostic composition for diagnosing a disease In a subject 
having a genome comprising a variant allele of the CYP3A5 gene, wherein said 
allele is having an A at a position corresponding to position 6986 of the CYP3A5 
gene (Accession No: AF2801 07.1, wherein position 166220 has been numbered +1 
and position 174832 has been numbered +8613, and Accession No: AC005020.2, 
wherein position 27341 has been numbered +8614). 
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The definitions of the terms referred to in this specification apply mutatis mutandis to the 
aforementioned use. 

The term "subjecf inter alia refers to animals. Preferably, said animals belong to the 
animal species referred to above. Moreover, the term "subject" encompasses humans. 
The humans in accordance with the use of the present invention are selected from all 
existing ethnical groups and subgroups, e.g. Caucasians, African Americans or Asians. 
However, particular well suited for the use of the invention are African Americans for which 
it could be demonstrated that diagnosing a disease or a prevalence for a disease based on 
monitoring the presence or absence of an CYP3A5 allele having at a position 
corresponding to position 6986 of the CYP3A5 gene (Accession No: AF280107.1 , wherein 
position 166220 has been numbered +1 and position 174832 has been numbered +8613, 
and Accession No: AC005020.2, wherein position 27341 has been numbered +8614) an A 
results in a false positive prediction for CYP3A5 expression in a considerable number of 
subjects. This allele of CYP3A5 has been described in detail in Kuehl, 2001, Nature 
Genetics 27: 383-391 as CYP3A5*1 allele. The CYP3A5*1 allele is characterized by the 
presence of an A at position 22893 of the CYP3A5 nucleic acid sequence referred to in 
Kuehl, loc.cit. The allelic frequency of said allele is particularly high in African Americans 
although it is also present in other ethnical groups or subgroups. However, it was found in 
accordance with the present invention that the CYP3A5 expression of subjects for which a 
false positive result was obtained In diagnostic studies based on the CYP3A5*1 allele 
could be correctly predicted by further diagnosing the presence or absence of a 
polynucleotide as defined under (a) to (d) in accordance with the use of the present 
invention. For example, a polynucleotide having an additional nucleotide at a position 
corresponding to position 27131/27132 of the CYP3A5 gene as defined supra has been 
found in accordance with this invention to be present in approximately 10% of the African 
Americans resulting in a f rameshift mutation in exon 1 1 . The present invention provides 
means and methods to distinguish between the haplotype resulting in improved expression 
of CYP3A5 comprising the polymorphism(s) of the GYP3A5*1 allele and the haplotype 
resulting in decreased expression, wherein said haplotype as set forth above comprises in 
addition to the polymorphism(s) of the CYP3A5*1 allele co-segregating polymorphisms 
comprised by a polynucleotide referred to under (a) to (d) supra. Thus, based on the 
aforementioned use of the present invention a reliable diagnosis of the CYP3A5 activity of 
a subject is achieved. 
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The invention also relates to the use of a polynucleotide comprising a polynucleotide 
having an A at a position corresponding to position 14690 of the CYP3A5 gene (Accession 
No: AF280107.1, wherein position 166220 has been numbered +1 and position 174832 
has been numbered +8613, and Accession No: AC005020.2, wherein position 27341 has 
been numbered +8614) for the preparation of a diagnostic composition for diagnosing a 
disease in a subject having a genome comprising a variant allele of the CYP3A5 gene, 
wherein said allele is having an A at a position corresponding to position 6986 of the 
CYP3A5 gene (Accession No: AF280107.1, wherein position 166220 has been numbered 
+1 and position 174832 has been numbered +8613, and Accession No: AC005020.2, 
wherein position 27341 has been numbered +8614). 

The definitions of the terms referred to in this specification apply mutatis mutandis to the 
aforementioned use. 

In accordance with the present invention it could also be shown that the polymorphism(s) 
constituting the CYP3A5*1 and the CYP3A5*6 allele as described in Kuehl, loc.cit., co- 
segregate in a considerable number of subjects and thereby constitute another haplotype 
resulting in decreased CYP3A5 expression. It has been shown that the CYP3A5*6 allele 
results in inappropriate splicing of exon 7 of CYP3A5, a frameshift and a premature 
termination at position 184 of the CYP3A5 protein. Consequently, false positive result as 
regards the expression level of CYP3A5 in a subject can be obtained in diagnostic studies 
based on the CYP3A5*1 allele. Said false positive results, however, can be avoided 
according to the use of this invention by further diagnosing the presence or absence of the 
polymorphism(s) of the CYP3A5*6 allele. Thus, based on the aforementioned use of the 
present invention a reliable diagnosis of the CYP3A5 activity of a subject is achieved. 

In light with the foregoing, in a preferred embodiment of the aforementioned use said 
subject is an African American. 

As has been discussed above, the number of subjects which are diagnosed false positive 
is due to the high allelic frequency of CYP3A5 alleles such as those comprising a 
polynucleotide as defined under (a) to (d) above resulting in a frame shift mutation or those 
comprised by the CYP3A5*6 allele. Said allelic frequency is particularly high within the 
group of African Americans. In accordance with the present invention it has been found 
that the CYP3A5*6 allele is present in about 13.3% of the African Americans. Thus within 
this ethnic group the problems emerging from a wrong prediction of CYP3A5 expression 
are more severe than for other ethnical groups. 
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In a more preferred embodiment of the use of the present invention said disease is cancer 
or diseases including cardiovascular diseases, diabetes and AIDS. 



Finally, the present invention relates to a diagnostic l<it for detection of a polymorphism 
comprising the polynucleotide, the gene, the vector, the polypeptide, the antibody, the host 
cell, the transgenic non-human animal or the solid support of the invention. 

The kit of the invention may contain further ingredients such as selection markers and 
components for selective media suitable for the generation of transgenic cells and animals. 
The kit of the invention can be used for carrying out a method of the invention and could 
be, inter alia, employed in a variety of applications, e.g., in the diagnostic field or as 
research tool. The parts of the kit of the invention can be packaged individually in vials or 
in combination in suitable containers or multicontainer units. Manufacture of the kit follows 
preferably standard procedures which are known to the person skilled in the art. The kit 
may be used for methods for detecting expression of a mutant form of the polypeptides, 
genes or polynucleotides in accordance with any one of the above-described methods of 
the invention, employing, for example, immunoassay techniques such as 
radioimmunoassay or enzymeimmunoassay or preferably nucleic acid hybridization and/or 
amplification techniques such as those described herein before and in the Examples as 
well as pharmacokinetic studies when using non-human transgenic animals of the 
invention. 
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The figures illustrate the Invention. 

Figure 1: A, Western blot analysis of CYP3A5 protein expression in microsonnes prepared 
from 6 LE (low expressing) and 6 HE (high expressing) Caucasian livers. B. The relative 
contributions of CYP3A5 and CYP3A4 to the combined CYP3A5/CYP3A4 protein pool in 
17 HE livers as determined by Western blot. 

Figure 2: Expression levels and the allelic source of CYP3A5 transcripts in LE and HE 
Caucasian livers. A. TaqMan analysis of CYPSAS mRNA in 8 LE (low expressing, white 
bars) and 8 HE (high expressing, grey bars) liver samples. B, Sequences of a portion of 
the 3'-UTR of CYP3A5 in samples heterozygous for variant ch-v-015 (Table 2A). 
Templates used for PGR were genomic DNA (left panel), cDNA from a LE liver (middle 
panel) and cDNA from an HE liver (right panel). 

Figure 3: Allelic frequencies of CYP3A5 genetic variants in Caucasians. A. in DNA 
samples derived from 8-168 LE individuals B. in DNA samples derived from 7-18 HE 
individuals. The bars at the bottom of the figure indicate schematically the localisation of 
the pseudoexons PS2 exon 1 and 2 and exons 1 - 13 of the CYP3A5 gene. The 
arrowhead marks the duplication boundary (Gellner, Pharmacogenetics 11 (2001), 111 - 
121). 

Figure 4: Genomic and peptide sequences: genomic DNA sequences containing the 
amplified regions in which polymorphisms were detected and polypeptide sequences with 
amino acid substitutions. Nucleotide sequences are listed in 5' - 3' orientation. Letters in 
lowercase indicate non-coding sequences, letters in uppercase indicate coding 
sequences. Primer regions are underlined. Variant sites are shown framed. Peptide 
sequences are shown in one letter code. || marks a site where a deletion has occurred. In 
Seq ID 1 98, the hybridzing site of the TaqMan® probe is shown in bold. 



37 



wo 02/053775 PCT/EPOl/15290 

Figure 5: CYP3A5 cDNA insert region of the plasmid that was used as starting material for 
in vitro mutagenesis. Cloning sites are shown underlined. Modified 5' and 3' regions of the 
CYP3A5 cDNA are shown in lowercase letters. The 5' modification, a MALLLAVF amino 
acid sequence on protein level, has been introduced in order to increase expression in E. 
CO// (Gillam, Arch Biochem Biophys 317 (1995), 374-84). The 3' modification, a Hise tag on 
protein level, has been introduced in order to enhance subsequent purification. The 
unmodified part of the CYP3A5 insert was verified to be identical to the CYP3A5 cDNA 
corresponding to accession no. Ni\/l_000777.1 by sequencing, which is the underlying 
nucleotide sequence for NP_000768.1. Sites corresponding to ch-v-009 and ch-v-001 are 
shown framed. 



The invention will now be described by reference to the following biological Examples 
which are merely illustrative and are not constructed as a limitation of the scope of the 
present invention. 

Example 1: Isolation of genomic DNA from hurnan blood, generation and 
purification of CYP3A5 gene fragments 

Genomic DNA was isolated from blood or liver samples using Qiagen blood and tissue 
DNA isolation kits. Oligonucleotides used in the screen were designed based on the 
recently determined sequence and organisation of the human CYP3A locus (Gellner, 
Pharmacogenetics 11 (2001), 111 - 121). Primer sequences and PGR fragment lengths 
are given in Table 1A. Amplified fragments were processed through PGR purification 
columns (Qiagen) and sequenced on PE ABI 3700 DNA Analysers using the same primers 
as in PGR. The sequences were analysed for the presence of polymorphisms using the 
PHRED/PHRAP/POLYPHRED/GONSED software package (University of Washington, 
Seattle, WA, USA). 

Total RNA was isolated from liver samples using the RNeasy kit (Qiagen) according to the 
manufacturers instructions except that an additional DNase I digestion was performed 
directly on the column. cDNA pools were generated from 1 /jg of total RNA using random 
hexamer primers and Superscript reverse transcriptase (Life Technologies). The cDNA 
used for one TaqMan assay was derived from 40 ng total RNA. CYP3A5 mRNA 
expression levels were quantified by real time quantitative PGR using the AB! 7700 
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Sequence Detection System (PE Biosystems). Oligonucleotides and probes were 
designed witli tlie Primer Express (PE Biosystems) programme. Oligonucleotides used for 
tlie quantitative PGR were: fonward 5'- TTG TTG GGA AAT GTT TTG TCC TAT C -3' (Seq 
ID: 237) and reverse 5'- ACA GGG AGT TGA CCT TCA TAG GTT -3' (Seq ID: 238). The 
TaqMan probe (5'- TGA GGG TGT GTG GAA ATT TGA GAG AGA GTG GTA-3'; Seq ID: 
239) was labelled with the 5' reporter dye 6-carboxy-fluorescein (FAM) and the 3' 
quencher 6-carboxy-tetramethylrhodamin (TAMRA). The experiments were performed 
according to a standard protocol developed by PE Biosystems. The specificity of the assay 
for CYP3A5 was determined using equal amounts of CYP3A4, CYP3A5, CYP3A7 and 
CYP3A43 cDNA species expressed in vitro. The specificity of the probe was 10"^ times 
higher for CYP3A5 than for CYP3A7 cDHA whereas CYP3A4 and CYP3A43 cDNAs were 
not detectable at all. The linear range of the CYP3A5 assay was determined to be 
between 10 and 10^ target molecules. CVP3>45 expression levels were normalised using 
the expression of 18S mRNA species detemiined with pre-developed TaqMan assays (PE 
Biosystems). 



Example 2: Determination of genetic variations witliin thie CYP3A5 locus 

Sequence diversity within the CYP3A5 locus was determined by PGR amplification from 
genomic DNA (fragment size: 264 - 997 bp) and sequencing each PCR-product of 19 - 
217 samples of Caucasian origin, 36 - 45 samples of African American origin, 34 - 47 
samples of Chinese origin, 41 - 50 samples of Japanese origin, and 31 - 47 samples of 
Korean origin. The PGR-- fragments encompass the entire protein-coding region of 
CYP3A5, a portion of the 3'-UTR, the entire 5'-UTR as well as 6203 bp sequence between 
the CYP3A5 transcriptional start site and a Li_5'UTR_ORF repeat located upstream of the 
gene (Fig. 3). In addition, we genotyped two linked single nucleotide polymorphisms 
(SNPs, ch-v-020, ch-v-021, Table 2A-E) located in a sequence originally described as 
CYP3A5 promoter that were recently reported to co-segregate with increased GYP3A5 
protein expression (Paulussen, Phannacogenetics 10 (2000), 415-24). The results also 
indicate co-segregation of both variants. Using the recently determined sequence of the 
entire CYP3A locus (Gellner, Phamiacogenetics 11 (2001), 111 - 121), we place these 
variants approximately 20 kb upstream of the first exon of CYP3A5, in a sequence 5' 
adjacent to a CYP3A pseudogene (PS2 in Fig. 3). Furthermore, we additionally genotyped 
a single nucleotide polymorphism located in intron 3 of the CYP3A5 gene (ch-v-048; 
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Kuehl, 2001, Nature Genetics 27: 383-391) by TaqMan® assay using tlie primers and 
probes listed in Table 1B. Tlie analysis were performed on a Sequence Detection System 
(PE Biosystems). 

A total of 29 variants including the two linked SNPs described by Paulussen et al. 
(Paulussen, Pharmacogenetics 10 (2000), 415-24) were detected in the screen of 
Caucasian samples and their allelic frequencies were estimated to be between 0.3 % and 
11.9 % (Table 2A). 6 variants are located within the 6 kb sequence upstream of the 
transcriptional start site of CYP3A5. 14 variants are located in introns, or in the 5'-UTR or 
3'-UTR, whereas 4 have been found in the protein-coding sequence. Among the latter 
ones, three variants result in amino acid substitutions and one in a premature termination 
of the CYP3A5 protein (Table 2A). The g.7303C>A variant (ch-v-009, Table 2A) results in 
a S100Y amino acid exchange in exon 4. The g.3705C>T variant (ch-v-005) leads to a 
H30Y amino acid exchange in exon 2. Cloning and sequencing revealed a physical linkage 
of this variant to the g.3709-3710insG variant (ch-v-006). The latter variant results in a shift 
of the open reading frame leading to a truncation of the protein sequence at position 34 
(K34.). The T398N variant (ch-v-001 , Table 2A), originally described by Jounaidi (Jounaidi, 
Biochem Biophys Res Commun 221 (1996), 466-70), was found in 3 out of 80 individuals 
tested. 

Neither of the four protein altering variants found in Caucasians have been found in the 
African-Americans, Chinese, Japanese or Korean samples (Table 2A-E). However, among 
others we have found 4 new variants in these samples that result in an altered CYP3A5 
amino acid sequence (ch-v-017, ch-v-043, ch-v-045, ch-v-068, Table 2B'-E). The g.27131- 
27132insT (ch-v-017) variant in exon 11 (ch-v-017, Table 2B, 2D) has been found in 9 out 
of 45 African-American samples and in one out of 50 Japanese samples. The variant 
results in a shift of the open reading frame which leads to a truncation of the protein 
sequence at position 348 (D348.). Variants ch-v-043, ch-v-045 and ch-v-068 lead to amino 
acid exchanges. 

Example 3: Identification of genetic determinants of CYPA5 protein expression 

In the following, the frequencies of Caucasian CYP3A5 gene variants have been analyzed 
as a function of CYP3A5 protein expression. For this purpose, allelic frequencies of 
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variants shown in Table 2A were calculated separately for HE and LE livers (Fig. 3). The 
frequencies of 9 variants (ch-v-020, ch-v-021 , ch-v-026, ch-v-034, ch-v-007, ch-v-008, ch- 
v-011, ch-v-014 and ch-v-015) were significantly increased in HE livers (all > 13.3, df = 
1, p < 0.01, Bonferroni corrected). Except one, all tested HE livers (17/18, 94 %) were 
heterozygous for three variants (ch-v-021, ch-v-026 and ch-v-015). 16 of those samples 
were heterozygous for ch-v-020 as well. One HE sample could not be genotyped for this 
variant. In contrast, LE livers were either wildtype (155/168, 92.3 %), heterozygous for 
variants ch-v-021 and ch-v-26 (9/168, 5.4 %) or heterozygous for the variant ch-v-015 
(4/168, 2.4 %) only. However, in LE livers all three variants never occurred simultaneously 
(Table 3). These results defined either of the three variants as a useful but imperfect 
marker of increased CYP3A5 expression. The variants ch-v-034, ch-v-008, ch-v-01 1 and 
ch-v-14 only occurred in a subset of the samples heterozygous for the above three 
variants (ch-v-021, ch-v-026 and ch-v-015). 

The distribution of variants ch-v-021, ch-v-026 and ch-v-015 in the samples screened 
strongly suggest that they constitute a haplotype. In the following, the hypotheses whether 
these three variants recombine independently or not has been tested. Assuming their 
independent inheritance, the expected 3-loci-genotype frequencies for all combinations of 
variants and compared them with the observed frequencies have been calculated. The 
difference Is highly significant (%2 = 93.6; classes 'all wildtype", 'single variant hetero- or 
homozygous', 'two or three variants hetero- or homozygous'; df = 1; p « 0.001). There 
were more individuals with two or three of the variants than expected and less individuals 
with only one of the variants. This result suggests linkage among the three variants. The 
degree of linloge with the linkage disequilibrium parameter D for the three pairs of variants 
was estimated. Using maximum likelihood estimates for haplotype frequencies, D was 
calculated to be 0.041 for the variant pairs ch-v-021 /ch-v-01 5 and ch-v-026/ch-v-015, 
which is 80 % of its theoretical maximum, and 0.065 for variants ch-v-021 and ch-v-026 
which corresponds to 100 % of its theoretical maximum. 

The probability that individuals showing the respective variant genotype are HE (positive 
predictive value) is estimated to be 65 % for variants ch-v-021 and ch-v-026, respectively, 
and 81 % for the ch-v-015 variant. For the combination of all three variants the positive 
predictive value is 100 % in our sample set. However, assuming that these variants need 
to be located in cis for increased protein expression, it is clear that there is some 
probability for individuals showing all three variants to be LE. The results show that at least 
the allele ch-v-021/ch-v-026 and the allele ch-v-015 actually exist (see genotype 2 and 3, 
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Table 3) and therefore the existence of a genotype with a combination of these two alleles 
has to be postulated. The maximum likelihood estimate for the frequency of these 3-fold 
heterozygotes having not all three variants in cis is 0.05 % of ail samples screened or 0,61 
% of samples hetero- or homozygote for all three variants. In other words, of 100 
Caucasians screened statistically about 9 of them will be hetero- or homozygous for all 
three variants and about 0.05 of these will have not all three variants in cis. Therefore, it 
can be expected that the positive predictive power of the 3-variant genotype to be about 
99.95 %. Of course, the same values would be achieved for a combination of only two 
variants, either ch-v-021/ch-v-015 or ch-v-26/ch-v-15. 

In a single HE liver none of the above 9- variants that were found in the other 17 HE 
samples could be detected. A closer examination of variants found in this sample revealed 
a variant within intron 4 (ch-v-018) and one within intron 5 (ch-v-019), respectively. These 
variants were unique to this sample, since they were not found in any other of the samples 
screened. Neither were they found in any of the other ethnic groups screened. It remains 
to be shown whether these variants are themselves causative for transcriptional activation 
or whether they are linked to another, so far undetected variant. 

Example 4: Determination of CYP3A5 protein expression 

Protein expression of CYP3A4 and CYP3A5 in Caucasian liver samples was determined 
by Western blotting using CYP3A4- and CYP3A5-specific antibodies (Gentest). Liver 
microsomes were prepared as previously described (Zanger, Biochemistry 27 (1988), 
5447-54). To obtain total protein homogenate, powdered liver tissue was homogenised in 
0,1 M Tris-CI pH 7.4, 1 mM EDTA, 1 mM Pefa Bloc SC, 1 //g/ml leupeptin, 1 //g/ml 
pepstatin with a Potter Elvehjem homogenisator (glass/Teflon) for 2 min at 1000 rpm. 
Homogenates were then sonified with a Bandelin Sonoplus HD 200 and stored at -80°C. 
For Western blotting, 12.5 //g microsomal protein homogenate or 40 /jg total protein 
homogenate were separated in a 10 % SDS-polyacrylamide gel. Electrophoretic transfer 
onto PVDF membranes was carried out in a TankBlot Cell (BioRad) for 1,5 hours at 
constant voltage (100 V) and at 10 °C. Following the transfer, the membranes were 
incubated for 60 min in 5 % milk, TBS, 0.1 % Tween 20 to reduce the unspecific antibody 
binding. Incubations with either primary antibody (Gentest, dilution 1 :500) were performed 
in 1 % milk, TBS, 0.1 % Tween 20 for 60 min, those with the secondary antibody (anti- 
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rabbit IgG-POD Fab-fragments, Dianova, dilution 1:10000 in the same solution for 30 min. 
CYP3A4 or CYP3A5 protein bands were detected with Supersignal Dura (Pierce) and a 
digital CCD-camera (LAS-1000, Fuji), Signal quantification was performed with AIDA 
(Raytest). Protein expression levels were calculated based on calibration curves obtained 
with microsomes expressing recombinant CYP3A4 and CYP3A5 proteins (Gentest). 

Homogenates or microsomal fractions were prepared from 186 human livers and 
investigated by Western blotting using a CYP3A5-specific antibody. CYP3A5 protein was 
detected in all samples analysed and its expression showed a clear bimodality (Fig. 1A). 
168 livers (- 90 %), further referred to as LE (low-expressing), showed expression close to 
or below the lower limit of quantification (LLOQ) of the assay (0.3 pmol/mg homogenate 
protein and 1.0 pmol/mg microsomal protein). Eighteen samples (-- 10 %), further referred 
to as HE (high-expressing), exhibited much higher CYP3A5 expression levels. The 
expression was in the range between 1 .6 and 2.9 pmol/mg homogenate protein (2.3 ± 0,5; 
n = 6) and between 3.9 and 15.5 pmol/mg microsomal protein (9.7 ±4.1; n = 12). Taking 
the LLOQ of the assay as the expression level of CYP3A5 in LE livers, HE livers express 
on average 8 to 10 times more CYP3A5 protein than LE livers. 

In the following, the contribution of CYP3A5 to the combined CYP3A5 and CYP3A4 
protein expression in HE livers was investigated. CYP3A4 expression in these livers was 
between 0.9 and 82.6 pmol/mg homogenate protein (n = 6) and between 4.5 and 295 
pmol/mg microsomal protein (n = 11). The levels and the range of CYP3A4 variability in 
HE livers were similar to those in LE livers (not shown). Fig. IB shows the share of 
CYP3A5 in the combined .CYP3A5 and CYP3A4 protein pool in 17 HE livers. CYP3A5 
contribution varies between 3 % and 74 %. In an average HE liver, the share of CYP3A5 in 
the combined pool of both proteins is 24 %. Taking the LLOQ of the CYP3A5 assay as the 
actual expression level of the protein in LE livers, the corresponding value in these livers is 
approximately 1 .6 %. 

Example 5: Determination of CYP3A5 mRNA expression 

The expression of CYP3A5 mRNA in 8 Caucasian HE and 8 LE livers was investigated 
using a CYP3A5-specific TaqMan probe. As illustrated in Fig. 2A, the distribution of 
CYP3A5 mRNA levels exhibited a bimodality which was in complete agreement with that 
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observed in the expression of CYP3A5 protein (Fisher's exact test, p = « 0.001). The 
number of 3A5 transcripts per ng of total RNA in HE livers (n = 8) was on average 8.5 
times higher than those in LE livers (n = 8). 

In the following, the allelic origin of CYP3A5 transcripts in HE and LE livers was 
investigated. To this end, by PGR a portion of the 3'-UTR (untranslated region) of the gene 
was amplified and sequenced using genomic or cDNA samples as templates which were 
heterozygous for a T>C variant located in this region (variant ch-v-015 in Table 2A;). As 
expected, both alleles are represented in the sequence using genomic DNAs as template 
(Fig. 2B). Both alleles were also equally represented in a sequence of PCR-amplified 
cDNA from a LE (homozygous wildtype forch-v-021 and ch-v-26, heterozygous for ch-v- 
015) liver. In contrast, only the C allele was found in the same portion of CYP3A5 3'-UTR 
cDNA from a HE liver. This indicates an overrepresentation of transcripts derived from the 
chromosome harbouring the C allele in the total pool of CyP3>^5 transcripts in HE livers. 



Example 6: In vitro mutagenesis and expression of recombinant CYP3A5 proteins 

Five polymorphisms in Caucasians have been detected that lead to changes in the protein 
sequence. Two of them, ch-v-006 and ch-v-017, lead to a truncation of the protein and 
therefore are unlikely to code for a functional protein. As ch-v-005 has only been found 
physically linked to ch-v-006, the resulting protein variant is not likely to be functional as 
well. To determine the effect of the protein variants ch-v-009 and ch-v-001 these variants 
were analysed in a heterologous bacterial expression system. 

A modified CYP3A5 cDNA in the prokaryotic expression vector pKK233-2 (Pharmacia) 
was used as starting material for in vitro mutagenesis (Fig. 5). Variants ch-v-009 and ch-v- 
001, respectively, were introduced into the plasmid by in vitro mutagenesis using the 
QuikChange mutagenesis kit (Stratagene). The successful introduction of the mutations 
and the absence of other, undesired mutations was confirmed by sequencing. The original 
plasmid as well as the two mutagenised plasmids were used to transform E. co// TOPP3 
cells, a strain in which optimal expression of CYP3A proteins has previously been 
obtained. A total of 8 separate colonies of each mutant plasmid were chosen for 
expression studies. The bacteria were grown and induced as described in Eiselt et aL 
(Eiselt, Pharmacogenetics 11 (2001), 447-58.). Expression was analysed 48 h and 72 h 
after induction with IPTG/5-ALA. Cells were harvested as described in Domanski et aL 
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(Domanski, Arch Biochem Biophys 350 (1998), 223-32). The final P450 content was 
measured by reduced carbon monoxide (CO) difference spectra (Omura, J. BioL Chem. 
239 (1964), 2370-2378). 

Whereas 30 to 50 nmol solubilised CYP3A5 could be recovered per litre culture of the non- 
mutagenised CYP3A5, expression in the two CYP3A5 variants S100Y and T398N was 
determined to be lower than 3 nmol P450 protein per litre culture. In many instances, the 
"P450" peak was shifted to 454 - 458 nm rather than the typical 448 - 450 nm peak 
expected. The low level of expression in mutagenised colonies made any attempts at 
protein purification futile. Previous experiments suggest that expression levels as low as 
those demonstrated by these CYP3A5 variants can not be significantly improved by 
utilising other bacterial strains or adjusting growth temperature. Therefore, the results 
strongly suggest that the CYP3A5 protein variants comprising the S100Y or the T398N 
substitutions are unstable in an E. coli expression system and that the variants comprising 
ch-v-009 or ch-v-001 may not code for functional proteins. The result of negative 
expression for variant ch-v-001 (T398N) is in agreement with the study in which this 
polymorphism was initially detected in two of five CYP3A5 deficient individuals (Jounaidi, 
Biochem Biophys Res Commun 221 (1996), 466-70). 

Example 7: Prediction of expected drug metabolism by CYP3A5 genotypes 

The CYP3A5 protein degrades many drugs by oxidation so that they are not 
therapeutically active anymore. Therefore, drugs that are CYP3A5 substrates might not 
reach therapeutically active plasma concentrations for an adequate time span in patients 
with enhanced CYP3A5 activity. In these patients these drugs have to be dosed higher. 
On the other side, in patients with reduced CYP3A5 activity, these drugs have to be 
administered at lower dosage in order to avoid toxic drug levels. Table 4 gives an 
assignment for CYP3A5 genotypes and recommended dosages. 

In cases in which CYP3A5 metabolism leads to the formation of pharmacologically active 
substance, enhanced enzyme activity has to be counteracted by reduced dosage whereas 
reduced CYP3A5 activity has to be met by increased dosage. 
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Table 1 A: Primers used to screen for polymorphisms witiiin the CYP3A5 upstream 

regions and the CYP3A5 gene. 



Ref. 


Primer 






ID 


Name 


Sequence (5'-3') 


chzk 


001 


694 


ACAGGCACAGAAACCCACAAG 


145448-145468 ' 




002 


711 


ATCGCCACTTGCCTTCTTC 


146077-146059 ' 


chzl 


003 


794 


CCCTGCTTCGGCTTGTGCA 


159915-159933 ' 


Of o 


004 


750 


CACAGCCTGCTTTATTTGTCATGA 


160489-160466 ' 


chzj 


005 


751 


GATCCTTGGTAGGACAAGCCT 


160351-160371 ' 




006 


754 


CAAGCACTGATTTGGTCACTTCCT 


161194-161171 ' 


chzb 


007 


819 


GGGATGGGACCGTAAGTGGAAC 


160951-160972 ' 


O 1 o 


008 


820 


TAATCACATTGGAGTTCTGACAAATG 


161568-161543 ' 


chzl 


009 


736 


AAAAACCTCTTACAAAAGTATCATCGGATA 


161419-161448 ' 


%y 1 U 


010 


737 


CCTACTAGGTCTCTGACTTGGAACCAT 


162328-162302' 


chzh 


011 


784 


GCCGAGACGCACCATTACACT 


161876-161896' 


DO/ 


012 


785 


CACCCATCCCTTCCCACTCAT 


-162512-162492 ' 


chza 


013 


740 


TGATGGTTCCAAGTCAGAGACCTAGTAG 


162300-162327 ' 


/ 


014 


741 


AATTGTAGACATCTTTCTCTTAAGTTAATTCCCAG 


163296-163262' 


chzf 


015 


786 


TCTGCATGCCAACAGTGAACAATCT 


163182-163206 ' 


fiOA 


016 


789 


GGCACGCACCAGCATGTCC 


164005-163987 ' 


chzG 


017 


790 


CTGGCTGAGTGCCGTGGCT 


163845-163863 ' 


0%7 1 


018 


791 


TGAGCGCTTCATGTATTCTGGCTAT 


164435-164411 ' 


chza 


019 


824 


AAATATTTTCAAAGTCACACTCTGACAACAG 


164376-164406 ' 


O 1 / 


020 


822 


TAACAGGATCTCATGCTTTTTTCATGGCT 


164992-164964' 


chzd 


021 


747 


CACTCCAATATTCACAATAGCCACTATTCA 


164843-164872 ' 




022 


748 


ACTCCTACGTATCCTTCCAAGCCC 


165768-165745' 


chzc 


023 


728 


GCTAAGGGAAACAGGCATAGAAACTTAC 


165586-165613 ' 


OO/ 


024 


729 


GGAGCTTCCCTGCCCTGC 


166142-166125 ' 


chzy 


025 


323 


TCCTTCTCCAGCACATAAATC 


166076-166096' 


AOA 


026 


325 


AAATTAGAAGGTGGATGGGAG 


166499-166479' 


chzx 


027 


335 


GAGTAACTCACCAGCCCTCTG 


169838-169858 ' 




028 


336 


AAACCTCAGAACTCCCTCCCA 


170101-170081 ' 


chzw 


029 


338 


GACATCTCTGAATAGCTTCCTTC 


171392-171414 ' 


AC\Q 


030 


341 


GCACATAGTTTATAACGGCAA 


171793-171773 ' 


chzv 


031 


346 


AGAACCTAAGGTTGCTGTGTGTC 


173303-173325 ' 




032 


348 


TGCAAGATGTTACCACTGGGC 


173696-173676 ' 


chzu 


033 


354 


CGCCCCACATACACTCAGAA 


31376-31395^ 


AOf\ 


034 


357 


AGACCATTTTTAGGAAGCTCG 


31801-31781 ^ 


chzt 


035 


379 


CAAGGGGTAGTCCACTGAGTTC 


31760-31781 


*+\JO 


036 


381 


CTCTTTGGAGTTGCAGCG 


32162-32145 


chzs 


037 


362 


AGGTGAGTCTAACTCAGCTTG 


33081-33101 ^ 


Zj f O 


038 


365 


GACAGCTAAAGTGTGTGAGGG 


33658-33638 ^ 


chzr 


039 


371 


AATGGGTTCCAGTTGAGAATC 


3441 1 -34431 ^ 




040 


373 


ATTGTTGTGCCTGATTTCAAG 


34880-34860 


chzc| 


041 


387 


AGAAGCCATAGGGAGGTTG 


35627-35645 ^ 




042 


389 


GACTGTCCTCCAAGCATTCT 


36049-36030 ^ 


chzp 


043 


394 


GATGCCATGATGAGGAGTGTG 


37724-37744 ^ 


626 


044 


397 


ACCAGGGCCAGCAATATTG 


38349-38331 


chzm 


045 


403 


AAATACTTCACGAATACTATGATCA 


45711-45735^ 


595 


046 


405 


CAGGGACATAATTGATTATCTTTG 


46305-46282 


chzo 


047 


411 


TACTGGTTGGGAGGTGGAG 


48290-48308 ^ 


456 


048 


412 


CATGATGTTCTTAATGCTACAGG 


48745-48723 ^ 


chzn 


049 


419 


GAAGAGTTCAAGATACATGGTGTTA 


50088-50112^ 


416 


050 


420 


TGCACAACACTCTACACAGACTC 


50503-50481 ^ 



Ref.: Reference sequence. The positions of primers refer to GenBank sequences 



with accession numbers (1) AF280107.1 and (2) AC005020.2. 
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Table 1B: Primers (Seq IDs 202 and 203) and probes (Seq IDs 204 and 205) used 
determine the nucleotide status at the polymorphic site ch-v-048 (g.6986G>A) by 



TaqM 


an assay. 




Primer 


Position (nt) 


(bp) 


ID 


Name 


Sequence (5'-3') 




202 


TQPLch-v-048_F 


GCTCTACTGTCATTTCTAACCATAAT 
CTCTTTA 


173152-173184 




chyu 


204 


TQPo ch-v- 

048_AI1__^G_VIC 


VIC-TGTCTTTCAGTATCTCTT- 
MGB-DQ 


173196-173213 


99 


205 


TQPo_ch-v- 
048_A12_A_ FAM 


FAM- TGTC TTTC AATATC TC TTC - 
MGB-DQ 


173196-173214 




203 


TQPLch-v-048_R 


GCTTCATATGATGAAGGGTAATGTGG 
T 


173250-173224 





Ref.: Reference sequence. The positions of the primers refer to the GenBank 
sequence with accession number AF280107.1. Probes are labelled with a 
fluorenscent dye at the 5' end and labelled with a dark quencher (DQ) arid using a 
minor groove binder (MGB). 
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Table 3: CYP3A5 genotypes and phenotypes. 





ch-v-021 


ch-v-026 


ch-v-015 


Phenotype 


Livers 


Genotype 1 


A/A 


A/A 


T/T 


LE 


155 


Genotype 2 


A/G 


A/G 


T/T 


LE 


9 


Genotype 3 


A/A 


A/A 


T/C 


LE 


4 


Genotype 4 


A/G 


A/G 


T/C 


HE 


17 


Genotype 5 


A/A 


A/A 


T/T 


HE 


1 



All three variants were observed only in the heterozygous state. HE = high 
expressing livers, LE = low expressing livers. Nunnbers indicate LE and HE livers 
with each particular genotype. The increased CYP3A5. expression co-segregates 
with a distinct genotype. 
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Table 4: Expected drug metabolism by CYP3A5 



Geno- 
type 
No. 


Allelic combination 


Enzvme 
Activity 


Dose Ad 


ustment 


drug 
degradation 


drug 
activation 


1 


high expressor allele/ 
high expressor allele 


190 % 


1.90 


0.53 


il, III 


low expressor allele/ 
high expressor allele 


100 % 


1.00 


1.00 


IV - VII 


null allele/ 

high expressor allele 


95% 


0.95 


1.05 


\/in 

V III, 

IX, X 


lOW ©xprGSSor aiiGie/ 
low expressor allele 


10% 


0.10 


10 


XI - 
XVIII 


null allele/ 

low exoressor allele 


5% 


0.05 


20 


XIX- 
XXV 


null allele/ 
null allelp 


0% 


<0.05 


>20 


Geno- 
type 
No. 


Genotvoe fSealD^ at Locus 

1-2-3-4-5-6-7-8 


Activity 


Dose Ad 


ustment 


drug 
degradation 


drug 
activation 


1 


060-062-079-081-087-111-103-108 / 
060-062-079-081-087-111-103-108 


190% 


1.90 


0.53 


II 


0xx-06x-079-081-087-lll-103-107 / 

060-062-079-081-087-111-103-108 


100% 


1.00 


1.00 


111 


059- 061-079-081-087-111-103-lOx /. 

060- 062-079-081-087-111-103-108 


100% 


1.00 


1.00 


IV 


0xx-06x-080-082-08x-llx-10x-10x / 
060-062-079-081-087-111-103-108 


95% 


0.95 


1.05 


V 


0xx-06x-0xx-08x-088-llx-10x-10x / 
060-062-079-081-087-111-103-108 


95 % 


0.95 


1.05 


VI 


0xx-06x-0xx-08x-08x-112-10x-10x / 
060-062-079-081-087-111-103-108 


95% 


0.95 


1.05 


VII 


0xx-06x-0xx-08x-08x-llx-104-10x / 
060-062-075-081-087-111-103-108 


95% 


0.95 


1.05 


VIII 


060-062-079-081-087-111-103-107 / 
059-061-079-081-087-111-103-108 


10% 


0.10 


10 


IX 


0xx-06x-079-081-087-lll-103-107 / 

0xx-06x-079-081- 087-111-103 -107 


10% 


0.10 


10 


X 


059-061-079-081-087-111-103 -lOx / 
059-061-079-081-087 -111-103 -lOx 


10% 


0.10 


10 


XI 


059-061-079-081-087-111-103-lOx / 

0xx-06x-08 0-082 -08x-llx-10x-10x 


5% 


0.05 


20 


XII 


0xx-06x-079-081-0 87-111-103-107 / 
0xx-06x-080-082-08x-llx-10x-10x 


5% 


0.05 


20 


XIII 


059-061-079-081-087-111-103 -lOx / 
Oxx- 0 6x- Oxx- 0 8x- 088-1 Ix- 1 Ox- 1 Ox 


5 % 


0.05 


20 


XIV 


0xx-06x-079-081-087-111^103-107 / 
Oxx- 0 6x- Oxx- 0 8x- 0 8 8 - 1 Ix- 1 Ox- 1 Ox 


5% 


0.05 


20 


XV 


059-061-079-081-087-111-103-lOx / 
0xx-06x-0xx-08x-08x-112-10x-10x . 


5% 


0.05 


20 


XVI 


0xx-06x-079-081-087-lll-103-107 / 


5% 


0.05 


20 
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0xx-06x-0xx-08x~08x-112-10x-10x 








XVII 


059-061- 079-081-087-111-103 -lOx / 
Oxx- 0 5x- Oxx- 0 8x- 0 8x- 1 Ix- 1 0 4 - 1 Ox 


c o/ 
O /o 






XVIII 

XX VIII 


0xx-06x-07 9 -081-087 -111-103 -107 / 
Oxx- 0 6x- Oxx- 0 8x- 0 8x- 1 Ix- 1 0 4 - 1 Ox 


O /o 


U.Ud 




XIX 


0xx-06x-080-082-08x-llx-10x-10x / 

Oxx- 0 6x- Oxx- 0 8x- 0 8 8 - 1 Ix- 1 Ox- 1 Ox 


U /o 


< u.uo 


^ OA 

> 


XX 


0xx-06x-080-082-08x-llx-10x-10x / 
0xx-06x-0xx-08x-08x-112-10x-10x 


U /o 


< u.uo 


> <!lU 


XXI 


0xx-0 6x-080-082-08x-llx-10x-10x / 
0xx-0 6x-0xx-08x-08x-llx-104-10x 


U /o 


< U.Ub 


> 


XXil 


0xx-06x-080-082-08x-llx-10x-10x / 
Oxx- 06X-080- 082-0 8x-llx-10x-10x 


U /o 


< U.uo 


> iOJ 


XXIII 


0xx-06x-0xx-08x-088-llx-10x-10x / 
0xx-06x-0xx-08x-08 8-llx-10x-10x 


0 % 


< 0 05 




XXIV 


0xx-06x-0xx-08x-08x-112-10x~10x / 
Oxx- 0 5x- Oxx- 0 8x- 0 8x- 112-1 Ox- 1 Ox 


0% 


< 0.05 


>20 


XXV 


0xx-06x-~0xx-08x-08x-llx-104-10x / 
0xx-06x-0xx-08x-08x-llx-104-10x 


0% 


<0.05 


>20 



A/o.: running genotype number. 



Genotype: Possible CYP3A5 genotypes that result from combinations of alleles. At 
the top of the table concise allele names have been used to indicate the principle. 
The lower table lists the alleles in greater detail, giving all combinations of variants at 
8 loci in the two homologous chromosomes (loci 1 -8 refer to positions corresponding 
to positions -20291, -6177, 3705, 3709/3710, 7303, 27131/27132, 27289 and 3161 1, 
respectively, of the CYP3A5 gene (Accession No: AF280107.1, wherein position 
166220 has been numbered +1 and position 174832 has been numbered +8613), 
respectively. Each variant is defined by a 3-digit Seq ID as listed in Table 2A-E. A 
wildcard (x) in Seq IDs indicates that the phenotype is independent from the variant 
at this locus in this chromosome. The possible variants for each locus that can be 
substituted for x can be extracted from Table 2A-E. For example, 08x at locus 4 
stands for Seq IDs 081 or 082, whereas 08x at locus 5 indicates either Seq ID 087 or 
088. 

Enzyme Activity, enzyme activity as calculated from protein concentration whereby 
the average protein concentration of genotype 059-061-079-081-111-107/060-062- 
079-081-111-108 was defined as 100 %. 

Dose Adjustment dose adjustment factors for drugs that are degraded/activated by 
CYP3A5 relative to the dosis required for genotype 059-061-079-081-111-107/060- 
062-079-081-111-108. Factors may need to be weighted according to the activity 
share of the CYP3A5 enzyme for drugs which are not exclusively metabolised by 
CYP3A5. 
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CLAIMS 

1. A polynucleotide comprising a polynucleotide selected from the group 
consisting of: 

(a) a polynucleotide having the nucleic acid sequence of SEQ ID NO: 54, 

56, 58, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 94, 
96, 98, 100, 102, 106, 108, 110, 112, 113, 114, 115, 116, 117, 118, 
119, 120, 121, 122, 123, 124, 125, 126, 128, 129, 130, 131, 133, 134, 
135, 136, 137, 138, 139, 140, 142, 143, 149, 151, 153, 155, 157, 159, 
161, 163, 165, 169, 171, 173, 175, 177, 179, 181, 183, 185, 187, 189, 
193, 195, 197, 199. 201, 207, 208, 209, 210, 211, 212, 213, 214, 216, 
218, 219, 220, 221, 222, 223, 224. 225. 226, 227, 228, 231, 232, 233, 
235, or 236; 

(b) a polynucleotide encoding a polypeptide having the amino acid 
sequence of SEQ ID NO: 127, 132, 141 , 215, 229, or 234; 

(c) a polynucleotide capable of hybridizing to a CYP3A5 gene, wherein said 
polynucleotide is having a nucleotide exchange, a nucleotide deletion of 
at least one nucleotide, or at least one additional nucleotide at a 
position corresponding to position -20643, -20555, -20359, -20367, 
-20329, -20323, -20310, -6200, -6177, -4336, -3990, -3868, -3844, 
-3557, -1617, -795, -86, -74, 136, 174 to 176, 230, 3705, 3709/3710, 
5215, 5235, 5516, 7182, 7207, 7303, 7424/7427, 12907, 13028, 13077, 

■13173, 13226, 13376, 14720, 14836, 14903, 15788, 16079, 
16931/16932, 16993, 17163, 19069, 19165, 19208, 27050, 
27131/27132, 27526, 31499, 31551 or 31611; 

(d) a polynucleotide capable of hybridizing to a CYP3A5 gene, wherein said 
' polynucleotide is having an A at a position corresponding to position 

-20555, -20329, -20323, -4336. -3868, -3844, -795, -86, 230, 5235, 
5516, 7182, 7303, 12907. 13028, 13376, 19069 or 19165 of the 
CYP3A5 gene (Accession No: AF28bl07.1, wherein position 166220 
has been numbered -i-l and position 174832 has been numbered 
+8613, and Accession No: AC005020.2, wherein position 27341 has 
been numbered +8614), a T at a position corresponding to position 
-20367, -6200, -74, 3705, 5215, 7207, 14836, 17163. 19208 or 27526 of 
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the CYP3A5 gene (Accession No: AF280107.1, wherein position 
166220 has been numbered +1 and position 174832 has been 
numbered +8613, and Accession No: AC005020.2, wherein position 
27341 has been numbered +8614), a G at a position corresponding to 
position -6177, -3990, 13077, 14720, 14903, 16993 or 27050 of the 
CYP3A5 gene (Accession No: AF280107.1, wherein position 166220 
has been numbered +1 and position 174832 has been numbered 
+8613, and Accession No: AC005020.2, wherein position 27341 has 
been numbered +8614), a C at a position corresponding to position 
-20643, -20310, -3557, -1617, 136, 13173, 13226, 15788, 16079, 
31499, 31551 or 31611 of the CYP3A5 gene (Accession No: 
AF280107.1, wherein position 166220 has been numbered +1 and 
position 174832 has been numbered +8613, and Accession No: 
AC005020.2, wherein position 27341 has been numbered +8614), 
nucleotide deletions at positions corresponding to positions 174 to 176 
of the CYP3A5 gene (Accession No: AF280107.1, wherein position 
166220 has been numbered +1 and position 174832 has been 
numbered +8613), an additional nucleotide at a position corresponding 
to position 3709/3710 or 27131/27132 of the CYP3A5 gene (Accession 
No: AF280107.1, wherein position 166220 has been numbered +1 and 
position 174832 has been numbered +8613, and Accession No: 
AC005020.2, wherein position 27341 has been numbered +8614), three 
additional nucleotides at a position corresponding to position-. 
16931/16932 of the CYP3A5 gene (Accession No: AF280107.1, 
wherein position 166220 has been numbered +1 and position 174832 
has been numbered +8613, and Accession No: AC005020.2, wherein 
position 27341 has been numbered +8614), or a deletion of two 
nucleotides and nine additional nucleotides inserted at a position 
corresponding to position 7424 to 7427 of the CYP3A5 gene (Accession 
No: AF280107,1, wherein position 166220 has been numbered +1 and 
position 174832 has been numbered +8613); 
(e) a polynucleotide encoding a CYP3A5 polypeptide or fragment thereof, 
wherein said polypeptide comprises an amino acid substitution at a 
position corresponding to position 30, 100, 130, 149 or 488 of the 
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CYP3A5 polypeptide (Accession No: NP_000768.1), or at least one 
amino acid exchange or a stop codon at a position corresponding to 
position 30 to 34 or 346 to 348 of the CYP3A5 polypeptide (Accession 
No: NP_000768.1); and 
(f) a polynucleotide encoding a CYP3A5 polypeptide or fragment thereof, 
wherein said polypeptide comprises amino acid substitutions of HGLFK 
to YGTF. (with the. period meaning termination) at a position 
corresponding to position 30 to 34 of the CYP3A5 polypeptide 
(Accession No: NP_000768,1), an amino acid substitution of S to Y at a 
position corresponding to position 100 of the CYP3A5 polypeptide 
(Accession No: NP_„000768.1), an amino acid substitution of R to Q at a 
position corresponding to position 130 of the CYP3A5 polypeptide 
(Accession No: NP_000768.1), an amino acid substitution of I to T at a 
position corresponding to position 149 of the CYP3A5 polypeptide 
(Accession No: NP_000768.1), an amino acid substitutions of TYD to 
YL. (with the period meaning termination) at position corresponding to 
position 346 to 348 of the CYP3A5 polypeptide (Accession No: 
NP_000768.1), or an amino acid substitution of I to T at a position 
corresponding to position 488 of the CYP3A5 polypeptide (Accession 
No: NP„000768.1). 

2. A polynucleotide of claim 1, wherein said polynucleotide is associated with 
cancer or diseases including cardiovascular diseases, diabetes and AIDS. 

3. A polynucleotide of any one of claims 1 to 2 which is DNA or RNA. 

4. A gene comprising the polynucleotide of any one of claims 1 to 2. 

5. The gene of claim 4 wherein a nucleotide deletion, addition and/or substitution 
results in altered expression of the variant gene compared to the 
corresponding wild type gene. 

6. A vector comprising a polynucleotide of any one of claims 1 to 3 or the gene of 
claim 4 or 5. 
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7. The vector of claim 6, wherein the polynucleotide Is operatively linked to 
expression control sequences allowing expression in prokaryotic or eukaryotic 
cells or isolated fractions thereof. 

8. A host cell genetically engineered with the polynucleotide of any one of claims 
1 to 3, the gene of claim 4 or 5 or the vector of claim 6 or 7. 

9. A method for producing a molecular variant CYP3A5 polypeptide or fragment 
thereof comprising 

(a) culturing the host cell of claim 8; and 

(b) recovering said protein or fragment from the culture. 

10. A method for producing cells capable of expressing a molecular variant 
CYP3A5 polypeptide comprising genetically engineering cells with the 
polynucleotide of any one of claims 1 to 3, the gene of claim 4 or 5 or the 
vector of claim 6 or 7. " . 

11. A polypeptide or fragment thereof encoded by the polynucleotide of any one of 
claims 1 to 3, the gene of claim 4 or 5 or obtainable by the method of claim 9 
or from cells produced by the method of claim 10. 

12. An antibody which binds specifically to the polypeptide of claim I t. 

13. The antibody of claim 12 which specifically recognizes an epitope containing 
one or more amino acid substitution(s) resulting from a nucleotide exchange 
as defined in claim 1 or 5. 

14. The antibody of claim 12 or 13 which is monoclonal or polyclonal. 

15. A transgenic non-human animal comprising at least one polynucleotide of any 
one of claims 1 to 4, the gene of claim 5 or 6 or the vector of claim 7 or 8. 
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16. The transgenic non-human animal of claim 15 which is a mouse, a rat or a 
zebrafish. 

17. A solid support comprising one or a plurality of the polynucleotide of any one 
of claims 1 to 3, the gene of claim 4 or 5, the vector of claim 6 or 7, the 
polypeptide of claim 1 1 , the antibody of claim 12 or 13 or the host cell of claim 
8 in immobilized form. 

18. The solid support of claim 17, wherein said solid support is a membrane, a 
glass-, polypropylene- or silicon-chip, are oligonucleotide conjugated beads or 
bead array, which is assembled on an optical filter substrate. 

19. An in vitro method for identifying a polymorphism said method comprising the 
steps of: 

(a) isolating a polynucleotide of any one claims 1 to 3 or the gene of claim 
4 or 5 from a plurality of subgroups of individuals, wherein one 

" subgroup has no prevalence for a CYP3A5 associated disease and at 
least one or more further subgroup(s) do have prevalence for a 
CYP3A5 associated disease; and 

(b) identifying a polymorphism by comparing the nucleic acid sequence of 
said polynucleotide or said gene of said one subgroup having no 
prevalence for a CYP3A5 associated disease with said at least one or 
more further subgroup(s) having a prevalence cyP3A5 associated 
disease, 

20. A method for identifying and obtaining a pro-drug or a drug capable of 
modulating the activity of a molecular variant of a CYP3A5 polypeptide 
comprising the steps of: 

(a) contacting the polypeptide of claim 11, the solid support of claim 17 or 
18, a cell expressing a molecular variant gene comprising a 
polynucleotide of any one of claims 1 to 3, the gene of claim 4 or 5 or 
the vector of claim 6 or 7 in the presence of components capable of 
providing a detectable signal in response to drug activity with a 
compound to be screened for pro-drug or drug activity; and 
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(c) detecting the presence or absence of a signal or increase or decrease 
of a signal generated from the pro-drug or the drug activity, wherein the 
absence, presence, increase or decrease of the signal is indicative for a 
putative pro-drug or drug. 

A method for identifying and obtaining an inhibitor of the activity of a molecular 
variant of a CYP3A5 polypeptide comprising the steps of: 

(d) contacting the protein of claim 1 1, the solid support of claim 17 or 18 or 
a cell expressing a molecular variant gene comprising a polynucleotide 
of any one of claims 1 to 3 or the gene of claim 4 or 5 or the vector of 
claim 6 or 7 in the presence of components capable of providing a 
detectable signal in response to drug activity with a compound to be 
screened for inhibiting activity; and 

(e) detecting the presence or absence of a signal or increase or decrease 
of a signal generated from the inhibiting activity, wherein the absence or 
decrease of the signal is indicative for a putative inhibitor. 

The method of claim 20 or 21 , wherein said cell is a cell of claim 9, obtained by 
the method of claim 10 or can be obtained by the transgenic non-human 
animal of claim 15 or 16. 

A method of identifying and obtaining a pro-drug or drug capable of modulating 
the activity of a molecular variant of a CYP3A5 polypeptide comprising the 
steps of: 

(a) contacting the host cell of claim 8, the cell obtained by the method of 
claim 10, the polypeptide of claim 1 1 or the solid support of claim 17 or 
18 with the first molecule known to be bound by a CYP3A5 polypeptide 
to form a first complex of said polypeptide and said first molecule; 

(b) contacting said first complex with a compound to be screened, and 

(c) measuring whether said compound displaces said first molecule from 
said first complex. 
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24. A method of identifying and obtaining an inhibitor capable of the activity of a 
molecular variant of a CYP3A6 polypeptide or its gene product comprising the 
steps of: 

(a) contacting the host cell of claim 8, the cell obtained by the method of 
claim 10, the protein of claim 11 or the solid support of claim 17 or 18 
with the first molecule known to be bound by a CYP3A5 polypeptide to 
form a first complex of said protein and said first molecule; 

(b) contacting said first complex with a compound to be screened, and 

(c) measuring whether said compound displaces said first molecule from 
said first complex. 

25. The method of claim 23 or 24, wherein said measuring step comprises 
measuring the formation of a second complex of said protein and said 
compound. 

26. The method of any one of claim 23 to 25, wherein said measuring step 
comprises measuring the amount of said first molecule that is not bound to 
said protein. 

27- The method of any one of claims 23 to 26, wherein said first molecule is 
labeled. 

28. A method for the production of a pharmaceutical composition comprising the 
steps of the method of any one of claims 20 to 27; and the further step of 
formulating the compound identified and obtained or a derivative thereof in a 
pharmaceutically acceptable form. 

29. A method of diagnosing a disorder related to the presence of a molecular 
variant of a CYP3A5 gene or susceptibility to such a disorder comprising 
determining the presence of a polynucleotide of any one of claims 1 to 3 or the 
gene of claim 4 or 5 in a sample from a subject. 

30. The method of claim 29 further comprising determining the presence of a 
polypeptide of claim 1 1 or the antibody of any one of claims 12 to 14. 
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31. A method of diagnosing a disorder related to the presence of a molecular 
variant of a CYP3A5 gene or susceptibility to such a disorder comprising 
determining the presence of a polypeptide of claim 1 1 or the antibody of any 
one of claims 12 to 14 in a sample from a subject. 

32. The method of any one of claims 29 to 31 , wherein said disorder is cancer or 
diseases including cardiovascular diseases, diabetes and AIDS. 

33. The method of any one of claims 29 to 32 comprising PGR, ligase chain 
reaction, restriction digestion, direct sequencing, nucleic acid amplification 
techniques, hybridization techniques mass spectroscopy or immunoassays. 

34. A method of detection of the polynucleotide of any one of claims 1 to 3 or the 
gene of claim 4 or 5 in a sample comprising the steps of 

(a) contacting the solid support of claim 17 or 18 with the sample under 
conditions allowing interaction of the polynucleotide of claim 1 to 3 or 
the gene of claim 4 or 5 with the immobilized targets on a solid support 
and; 

(b) determining the binding of said polynucleotide or said gene to said 
immobilized targets on a solid support. 

35. An in vitro method for diagnosing, a disease comprising the steps of the 
method of claim 34, wherein binding of said polynucleotide or gene to said 
immobilized targets on said solid support is indicative for the presence or the 
absence of said disease or a prevalence for said disease. 

36. A diagnostic composition comprising the polynucleotide of any one of claims 1 
to 4, the gene of claim 3 or 4, the vector of claim 6 or 7, the polypeptide of 
claim 1 1 or the antibody of claim 12 or 13. 

37. A pharmaceutical composition comprising the polynucleotide of any one of 
claims 1 to 3, the gene of claim 4 or 5, the vector of claim 6 or 7, the 
polypeptide of claim 1 1 or the antibody of claim 12 or 13. 
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38. Use of the polynucleotide of any one of claims 1 to 3, a polynucleotide 
connprising SEQ ID No: 104, a polynucleotide encoding a polypeptide 
connprlsing SEQ ID No: 145, the gene of claim 4 or 5, the vector of claim 6 or 
7, the polypeptide of claim 11, a polypeptide comprising SEQ ID No: 145 or 
the antibody of claim 12 or 13 for the preparation of a diagnostic composition 
for diagnosing a disease. 

39. Use of the polynucleotide of any one of claims 1 to .3, a polynucleotide 
comprising SEQ ID No: 104, a polynucleotide encoding a polypeptide 
comprising SEQ ID No: 145, the gene of claim 4 or 5, the vector of claim 6 or 
7, the polypeptide of claim 11, a polypeptide comprising SEQ ID No: 145 or 
the antibody of claim 12 or 13 for the preparation of a pharmaceutical 
composition for treating a disease. 

40. Use of a polynucleotide selected from the group consisting of: 

(a) a polynucleotide having the nucleic acid sequence of SEQ ID NO: 82, 
88, 104 or 112; 

(b) a polynucleotide encoding a polypeptide having the amino acid 
sequence of SEQ ID No: 127, 132, 141 or 145; 

(c) a polynucleotide capable of hybridizing to a CYP3A5 gene, wherein 
said polynucleotide is having at least one additional nucleotide at a 
position corresponding to position 3709/3710 or 27131/27132 of the 
CYP3A5 gene (Accession No: AF280107.1, wherein position 166220 
has been numbered +1 and position 174832 has been numbered 
+8613, and Accession No: AC005020.2, wherein position 27341 has 
been numbered +8614) or a nucleotide exchange at a position 
corresponding to position 7303 or 27289 of the CYP3A5 gene 
(Accession No: AF280107.1, wherein position 166220 has been 
numbered +1 and position 174832 has been numbered +8613, and 
Accession No: AC005020.2, wherein position 27341 has been 
numbered +8614); 

(d) a polynucleotide capable of hybridizing to a CYP3A5 gene, wherein 
said polynucleotide is having an additional G nucleotide at a position 
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corresponding to position 3709/3710 of tine CYP3A5 gene (Accession 
No: AF280107.1, wherein position 166220 has been numbered +1 and 
position 174832 has been numbered +8613, and Accession No: 
AC005020.2, wherein position 27341 has been numbered +8614), an 
additional T nucleotide at a position corresponding to position 
27131/27132 of the CYP3A5 gene (Accession No: AF280107.1, 
wherein position 166220 has been numbered +1 and position 174832 
has been numbered +8613, and Accession No: AC005020.2, wherein 
position 27341 has been numbered +8614), or an A at a position 
corresponding to position 7303 or 27289 of the CYP3A5 gene 
(Accession No: AF280 107.1, wherein position 166220 has been 
numbered +1 and position 174832 has been numbered +8613, and 
Accession No: AC005020.2, wherein position 27341 has been 
numbered +8614); 

for the preparation of a diagnostic composition for diagnosing a disease in a 
subject having a genome comprising a variant allele of the CYP3A5 gene, 
wherein said allele is having an A at a position corresponding to position 6986 
of the CYP3A5 gene (Accession No: AF280107.1, wherein position 166220 
has been numbered +1 and position 174832 has been numbered +8613, and 
Accession No: AC005020.2, wherein position 27341 has been numbered 
+8614). 

41 . Use of a polynucleotide comprising a polynucleotide having an A at a position 
corresponding to position 14690 of the CYP3A5 gene (Accession No: 
AF280107.1, wherein position 166220 has been numbered +1 and position 
174832 has been numbered +8613, and Accession No: AC005020.2, wherein 
position 27341 has been numbered +8614) for the preparation of a diagnostic 
composition for diagnosing a disease in a subject having a genome comprising 
a variant allele of the CYP3A5 gene, wherein said allele is having an A at a 
position corresponding to position 6986 of the CYP3A5 gene (Accession No: 
AF280107.1, wherein position 166220 has been numbered +1 and position 
174832 has been numbered +861.3, and Accession-No: AC005020.2, wherein 
position 27341 has been numbered +8614). 
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42. The use of claim 40 or 41 , wherein said subject is an African American. 

43. The use of any one of claims 38 to 41, wherein said disease is cancer or 
diseases including cardiovascular diseases, diabetes and AIDS. 

44. A diagnostic l<it for detection of a single nucleotide polymorphism comprising 
the polynucleotide of any one of claims 1 to 3, the gene of claim 4 or 5, the 
vector of claim 6 or 7, the polypeptide of claim 1 1 , the antibody of claim 12 or 
13, the host cell of claim 8, the transgenic non-human animal of claim 15 or 16 
or the solid support of claim 17 or 18. 
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Figure 4 



Seq ID: 113 

> chz k_ch- V- 0 2 0_2 5 4T>G_ch-- v- 0 2 1_5 8 2 A>G 

1 tggtcaccca ccatgtgtac agtaccctgc tagggtccag ggtcatgaaa gtaaataata 
61 ccagactgtg cccttgagga actcacctct gctaagggaa acaggcacag aaacccacaa 
121 gggtggtaga gaggaaatag gacaatagga ctgtgtgagg gggataggag gcacccagag 
181 gaggaaatgg ttacatctgt gtgaggaggt tggtaaggaa agactttaat agaaggggtc 
241 tgtctggctg ggc^tgcaag gatgtgtagg agtcatctag ggggcacaag tacactccag 
3 01 gcagagggaa ttgcatgggt aaagatctgc agttgtggct tgtggggatg gatttcaagt 
3 61 attctggaat gaagacagcc atggaaacaa gggcaggtga gaggatattt aagaggcttc 
421 atgccaatgg ctccacttca gtttctgata agaactcagg ttccgtggac tccctgataa 
481 aactgattaa gttgtttatg attccccata gaatatgaac tcaaaggagg' taagcaaagg 
541 ggtgtgtgcg attctttgct actggctgca gctgcagccc c|^cctccttc tccagcacat 
601 aaacatttca gcagcttgac ctaagactgc "tgtgcagggc agggatgctc caggcagaca 
661 gcccagcaaa caacagcaca cagctgaaag taagactcag aggagacagt t gaagaaggc 
721 aagtggcgat ggacctcatc ccaaatttgg cggtggaaac ctggcttctc ctggctgtca 
781 gcctggtgct cctctatctg tcagtaactg tccagattcc tctcctctgt 

Seq ID: 114 
>chzk_ch~v~031_318G>A 

1 tggtcaccca ccatgtgtac agtaccctgc tagggtccag ggtcatgaaa gtaaataata 
61 ccagactgtg cccttgagga actcacctct gctaagggaa acaggcacag aaacccacaa 
121 gggtggtaga gaggaaatag gacaatagga ctgtgtgagg gggataggag gcacccagaa 
181 gaggaaatgg ttacatctgt gtgaggaggt tggtaaggaa agactttaat agaaggggtc 
241 tgtctggctg ggcttgcaag gatgtgtagg agtcatctag^ ggggcacaag tacactccag 
3 01 gcagagggaa ttgcatgggt aaagatctgc agttgtggct tgtggggatg gatttcaagt 
3 61 attctggaat gaagacagcc atggaaacaa gggcaggtga gaggatattt aagaggcttc 
421 atgccaatgg ctccacttca gtttctgata agaactcagg ttccgtggac tccctgataa 
481 aactgattaa gttgtttatg attccccata gaatatgaac tcaaaggagg taagcaaagg 
541 ggtgtgtgcg attctttgct actggctgca gctgcagccc cacctccttc tccagcacat 
601 aaacatttca gcagcttgac ctaagactgc tgtgcagggc agggatgctc caggcagaca 
661 gcccagcaaa caacagcaca cagctgaaag taagactcag aggagacagt t gaagaaqcrc 
721 aagtggcgat ggacctcatc ccaaatttgg cggtggaaac ctggcttctc ctggctgtca 
781 gcctggtgct cctctatctg tcagtaactg tccagattcc tctcctctgt 

Seq ID: 115 
>chzk_ch-v-032_544G>A 

1 tggtcaccca ccatgtgtac agtaccctgc tagggtccag ggtcatgaaa gtaaataata 
61 ccagactgtg cccttgagga actcacctct gctaagggaa acaggcacag aaacccacaa 
121 gggtggtaga gaggaaatag gacaatagga ctgtgtgagg gggataggag gcacccagag 
181 gaggaaatgg ttacatctgt gtgaggaggt tggtaaggaa agactttaat agaaggggtc 
241 tgtctggctg ggcttgcaag gatgtgtagg agtcatctag ggggcacaag tacactccag 
3 01 gcagagggaa ttgcatgggt aaagatctgc agttgtggct tgtggggatg gatttcaagt 
3 61 attctggaat gaagacagcc atggaaacaa gggcaggtga gaggatattt aagaggcttc 
421 atgccaatgg ctccacttca gtttctgata agaactcagg ttccgtggac tccctgataa 
481 aactgattaa gttgtttatg attccccata gaatatgaac tcaaaggagg taagcaaagg 
541 ggtgtgtgcg attctttgct actggctgca gctgcagccc cacctccttc tccagcacat 
601 aaacatttca gcagcttgac ctaagactgc tgtgcagggc agggatgctc caggcagaca 
661 gcccagcaaa caacagcaca cagctgaaag taagactcag aggagacagt t gaagaagcrc 
721 aagtggcgat ggacctcatc ccaaatttgg cggtggaaac ctggcttctc ctggctgtca 
781 gcctggtgct cctctatctg tcagtaactg tccagattcc tctcctctgt 
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Seq ID: 116 
>chzk_ch-v- 0 3 3_5 5 OG>A 

1 tggtcaccca ccatgtgtac agtaccctgc tagggtccag ggtcatgaaa gtaaataata 

61 ccagactgtg cccttgagga actcacctct gctaagggaa acaggcacag aaacccacaa 

121 gggtggtaga gaggaaatag gacaatagga ctgtgtgagg gggataggag gcacccagag 

181 gaggaaatgg ttacatctgt gtgaggaggt tggtaaggaa agactttaat agaaggggtc 

241 tgtctggctg ggcttgcaag gatgtgtagg agtcatctag ggggcacaag tacactccag 

3 01 gcagagggaa ttgcatgggt aaagatctgc agttgtggct tgtggggatg gatttcaagt 

3 61 attctggaat gaagacagcc atggaaacaa gggcaggtga gaggatattt aagaggcttc 

421 atgccaatgg ctccacttca gtttctgata agaactcagg ttccgtggac. tccctgat.aa 

481 aactgattaa gttgtttatg attccccata gaatatgaac tcaaaggagg * taagcaaagg 
541 ggtgtgtgcg attctttgct actggctgca gctgcagccc cacctccttc tccagcacat 

601 aaacatttca gcagcttgac ctaagactgc "tgtgcagggc agggatgctc caggcagaca 

6 61 gcccagcaaa caacagcaca cagctgaaag taagactcag aggagacagt t gaagaaggc 

721 aagtggcgat ggacctcatc ccaaatttgg cggtggaaac ctggcttctc ctggctgtca 

781 gcctggtgct cctctatctg tcagtaactg tccagattcc tctcctctgt 

Seq ID: 117 

>chz l_ch- V- 0 2 6_2 2 9A>G 

1 ggagtgacct gattttccag gtgctgtctg tcaccccttt ctttgactag gaaagggaac 
61 tccttgaccc cttgcgcttc tcaagtgagg caatgcctcg ccctgcttcg gcttgtgca c 
121 agcacgctgc acccactgtc ctgcacccac tgtctggcac tccctagtga gatgaacccg 
181 gtacctcaga tgaaaatgca gaaatcaccc gtcttctgtg tcactcac[^c tgggagctgt 
241 agaccggagc tgttcctatt cggccatctt ggctccaccc cccgagtttt ggcttttaat 
3 01 tgaaagtgta ttgatgtggg aaggagataa tgccatgcat' ttatgagcac atattagagg 
3 61 gtctgagaca atgcatgtga taaaaggtca cctaaggaag aaaaaaagaa caagggaaga 
421 cactggaaag aacgtgatgc tgggagtccc tgggccacca aagtctggag aaaagtggta 
481 accacaaggc tcccagccta gtttcactga agacctcgac actaggtgac ttatgggatc 
541 cttggtagga caagccttga aaagttttac aatagcaaat gtggacgttg tcagaaccaa 
6 01 atgatgtcac gtgtgtattt gtgtgtgtgt gtcagtgtgt gtgtttaaaa atcatgacaa 
661 ataaagcagg ctgtg aagag gggattccca tgctcgtgtg cctgataaca caactatcac 
721 aaacgctttg cgaaaccaca agtttgcaca aaggcaatcc caaccttaca caaaa 

Seq ID: 118 

>chz i_ch-v- 0 2 7„5 6 6 G>A 

1 taatgaaaat aagaattatt ttgatggctc taacagtgac atttatatca tctgttttat 
61 ctggagcatt ctataataag tttatattaa gcaaatcaat aaaaacctct tacaaaagta 
121 tcatcggata ctttcctgaa cattaaggag aaatctatag aactgaatga atgagaacca 
181 acaagtaaat atatgtgatc attgtaacca ttgttggtgt ggggcatttg tcagaactcc 
241 aatgtgatta ttaacatagg tgagaattaa tccactgtga ctttgcccat tgcttagaaa 
3 01 gaacattcat agtttaatta tgcctttttt gaccaagcac agtggctcat gcctgtaatc 
3 61 ccagcact tt gggaggccga. ggtgggtgga tcacctgagg tcaggagttc gagaccagcc 
421 tgaccaacat ggtgaaaccc catctctact aaaaatacaa aaattagcta ggtgtggtgg 
481 tatgcaccta taatctcagc tacccaggag gctgaggcag gagaatcact tgaacctgga 
541 ggcagaggtt gcagtgagcc gagac|a]cacc attacactcc agcctgggtg acagagtgag 
601 attccatctc aaaaaaaaaa aaaaaaaatt atgccttttt gaagcacata cattttataa 
661 catacaactg aatcccttat tatattatta gttttgattt aatgttttca aaccatctcc 
721 cctgatattt ctgggagatg ggaaacatgt tttcttacac ctcttgcatt ccattctcaa 
781 ctcccaactg tcttactgca atgaacactt aataagaaac agtcaattgg tcaattgatt 
841 gggcaacagg ctaaacacac tcattccttg tctgttccca cttctttctt tactttccct 
901 tcctgagtaa cttatcctaa agtcattagg tgggtggcag ccagatggtg gccacacatt 
961 aaggtagaaa agagagtgtc at gatggttc caagtcagag acctagtagg gtgaggatca 

1021 agtaggtgtt cacgtggaga aacagcccgg cctgtgtgtg ggagtccaag caagcagaga 

1081 aaatgtcgac acagaggggt ggcctgaaaa 
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Seq ID: 119 
>chzh_ch-v- 0 2 8_6 0 1G>A 

1 caaaaattag ctaggtgtgg tggtatgcac ctataatctc agctacccag gaggctgagg 
61 caggagaatc acttgaacct ggaggcagag gttgcagtga gccgagacgc accattacac 
121 tccagcctgg g-tgacagagt gagattccat ctcaaaaaaa aaaaaaaaaa attatgcctt 
181 tttgaagcac atacatttta taacatacaa ctgaatccct tattatatta ttagttttga 
241 tttaatgttt tcaaaccatc tcccctgata tttctgggag atgggaaaca tgttttctta 
301 cacctcttgc attccattct caactcccaa ctgtcttact gcaatgaaca cttaataaga 
3 61 aacagtcaat tggtcaattg attgggcaac aggctaaaca cactcattcc ttgtctgttc 
421 ccacttcttt ctttactttc ccttcctgag taacttatcc taaagtcatt aggtgggtgg 
481 cagccagatg gtggccacac attaaggtag aaaagagagt gtcatgatgg " ttccaagtca 
541 gagacctagt agggtgagga tcaagtaggt gttcacgtgg agaaacagcc cggcctgtgt 
601 gtgggagtcc aagcaagcag agaaaatgtc 'gacacagagg ggtggcctga aaaagcagcc 
661 agagcctaaa cagggcatgg agaacatatt tagggcatga ggtgaggagg gcatcc atga 
721 gtgggaaggg atgggtg agg tttcactaca taaaggggat tgatgaaata agtaaataaa 
781 gtatactgga agccaggtgt gtcacttttg cagaaaagag tcatggattc agaaagg 

Seq ID: 120 

>chz g_ch-v- 0 2 9_4 6 4T>C 

1 ttctttcttt actttccctt cctgagtaac ttatcctaaa gtcattaggt gggtggcagc 
61 cagatggtgg ccacacatta aggtagaaaa gagagtgtca tgatggttcc aagtcagaga 
121 cctagtag gg tgaggatcaa gtaggtgttc acgtggagaa acagcccggc ctgtgtgtgg 
181 gagtccaagc aagcagagaa aatgtcgaca cagaggggtg gcctgaaaaa gcagccagag 
241 cctaaacagg gcatggagaa catatttagg gcatgaggtg aggagggcat ccatgagtgg 
3 01 gaagggatgg gtgaggtttc actacataaa ggggattgat gaaataagta aataaagtat 
3 61 actggaagcc aggtgtgtca cttttgcaga aaagagtcat ggattcagaa agggagaaaa 
421 ctagcaggaa tcctatgaaa ttagattaaa atggatgtat ccalc|gtatat tcataccctt 
481 ctagatagat aaatggttag ataggtgata aaaagataac aagaggacaa gataattaga 
541 tagacataaa tgtatgtatg tgtttgtgtg tgtgtacaaa aaaacatata ctccctactt 
601 ctctccactg atagggctag gtaacaatgg catttcaata gcaatgagca cacttagtgg 
661 ccagatcttg gcttattaat accattttcc actgaaagga accagagctt tttagagaaa 
7 21 tggctgattc cagggccagg attaagaatg ttcaagataa gcctaggata cattttgtgc 
781 caggaagcaa gaagatgttc aaatgatttc caagtaatgt ttggaaatga tatttgaaaa 
841 tgatttccaa atgatatttc caaatgattt ccaaatgata tatggaaaca cttaaagact 
901 ccactaaaga actattagat ctgataaaca aattcagtaa tgttgctgga tacaaaatca 
961 acatacaaaa accagtagca tttctgcatg ccaacagtga acaatctggc aaaaataaaa 
1021 aatgtaatcc catttacaat aaccccaaat aaaactaaat ac ctgggaat taacttaaga 
1081 gaaagatgtc tacaatt aat attgtaaaac actgatgaag gaaattgaag aagacacaaa 
1141 aaagaaggat attccatgtt tatatattgt aagcattaat attgttaaaa atgtcca 

Seq ID: 121 

>chza_ch- V- 0 3 4_3 2 8T>C 

1 caaagcaaaa atggacaaat ggatcagatc aagttaaaaa gcttctgtac cacaaagaaa 
61 gcaatcaaca aagtgaagac acaaaccaca gaatgggaga aaatattttc aaagtcacac 
121 tctgacaaca g attaatacrc cagaatacat gaagcgctca aacaactctg taaggaaaaa 
181 tctaataatc caatcaaaaa atgggcaaaa tttgaataga catttttcaa aagaagacat 
241 acaaatgcca cataggcata tgataaggtg ctcaacatca ctggtcatta gagaaatgca 
3 01 aatcaaaacc acaatgagat atcatctlcjac cccagctaaa atggttttta tccaaaagac 
3 61 aggcaacaac aaatgccagc gagaatgtgg agaaaaggga acccttgtac actgttggtg 
421 taaattagtg caaccactat agagaacaat ttggaggttc ctcaaaacat taaaattaac 
481 attaaataga gctaccacaa tatccagaaa tccccatgct gggtatatac ctggaagaaa 
541 ggaaatcata tattgaagag ataacatcac tccaatattc acaatagcca ctattcacaa 
601 atgccaagat ttggaagcaa cctaagtgtc catcaacaga tgaatggata aagaaagtac 
661 tccaattata cacaatggag cacaattc acr ccatgaaaaa agcatgagat cctgtta tct 
721 gtaataatat ggatggaact ggaggtcatc atgttaagtg aaataagcca ggcacagaaa 
781 cacagatatt gcaagttctc acatacttgt gggatct 
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Seq ID: 122 

>chz d_ch- V- 0 3 0_6 8 3 T> A 

1 cattaaaatt aacattaaat agagctacca caatatccag aaatccccat gctgggtata 
61 tacctggaag aaaggaaatc atatattgaa gagataacat cactccaata ttcacaatag 
121 ccactattca caaatgccaa gatttggaag caacctaagt gtccatcaac agatgaatgg 
181 ataaagaaag tactccaatt atacacaatg gagcacaatt cagccatgaa aaaagcatga 
241 gatcctgtta tctgtaataa tatggatgga actggaggtc atcatgttaa gtgaaataag 
3 01 ccaggcacag aaacacagat attgcaagtt ctcacatact tgtgggatct acaaatcaaa 
3 61 acaactgagc taatgtctgg gccttagtca gtgttgtacc caagtactgg gagcacagct 
421 tttaaaatac atcatgaatg ctttaataca ggaatgaata gatgagaggc acaaactggt 
481 tgggtgttct tctgatacac agtatcttcc ttgacagatt cagtacaact ctcaacaggt 
541 aagtctcttc atgttatgtt accttatgag gaattaagtg gcagaacatg atttctatta 
601 ttttcctttg cagaacaaga ccaactttaf tagttgggac acagtgtggc tgcatttgag 
661 tcccaagcaa ccattagtct at|a|gctatca ccacagagtc agaggggatg agacgcccag 
721 caatctcacc caagacaact ccaccaacat tcctggttac ccaccatgtg tacagtaccc 
7 81 tgctaggaac cagggtcatg aaagtaaata ataccagact gtgcccttga ggagctcacc 
841 tctgctaagg gaaacaggca tagaaactta caatggtggt agagagaaaa gaggacaata 
9 01 ggactgtgtg agggggatag gaggcaccca gaggaggaaa tggttacatt tgtgtgagga 
9 61 ggttggtaag gaaaaatttt agcagaaggg gtctgtctgg ct gggcttgg aaggatacgt 
^021 aggagt catc tagagggcac aggtacactc caggcagagg gaatttcgtg ggtaaagatg 
1081 tgtaggtgtg gcttgtgagg atggatttca attattctag aatgaa 

Seq ID: 123 
>ciizy_c]:i- V- 0 0 2_1 5 9G>A 

1 gattaagctt ttcatgattc ctcatagaac atgaactcaa aagaggtcag caaaggggtg 

61 tgtgcgattc tttgctattg gctgcagcta tagccctgcc tccttctcca gcacataaat 

121 ctttcagcag cttggctgaa gactgctgtg cagggcaggg aagctccagg caaacagccc 

181 agcaaacagc agcactcagc taaaaggaag actcacagaa cacagttgaa gaaggaaagt 

241 ggcgATGGAC CTCATCCCAA ATTTGGCGGT GGAAACCTGG CTTCTCCTGG CTGTCAGCCT 

3 01 GGTGCTCCTC TATCTgtgag taactgtcca aactcctctc tttgtttcct tggacttggg 

3 61 gtgctaatcg ggcccctttt cccttatctg ttttgaagat caaaagagat gttcaaggag 
421 aagtagctga agtgttggac gctacaaacg catagaagtt attattatct- tatgcagatc 

481 tatgaatgaa taaataagca ttt ctcccat ccaccttcta attt tggtga ctaggagggt 

541 ttagggacag catttggtag tgggaatgat ttgattagct tagatctgac gaagactaat 
601 caatgaaaac atggcagcgg caga 

Seq ID: 124 
>chzy_ch.-v-003__171C>T 

1 gattaagctt ttcatgattc ctcatagaac atgaactcaa aagaggtcag caaaggggtg 
61 tgtgcgattc tttgctattg gctgcagcta tagccctgcc tccttctcca gcacataaat 
121 ctttcagcag cttggctgaa gactgctgtg cagggcaggg aagctccagg [tjaaacagccc 
181 agcaaacagc agcactcagc taaaaggaag actcacagaa cacagttgaa gaaggaaagt 
241 ggcgATGGAC CTCATCCCAA ATTTGGCGGT GGAAACCTGG CTTCTCCTGG CTGTCAGCCT 
3 01 GGTGCTCCTC TATCTgtgag taactgtcca aactcctctc tttgtttcct tggacttggg 
3 61 gtgctaatcg ggcccctttt cccttatctg ttttgaagat caaaagagat gttcaaggag 
421 aagtagctga agtgttggac gctacaaacg catagaagtt attattatct tatgcagatc 
481 tatgaatgaa taaataagca ttt ctcccat ccaccttcta attt tggtga ctaggagggt 
541 ttagggacag catttggtag tgggaatgat ttgattagct tagatctgac gaagactaat 
601 caatgaaaac atggcagcgg caga 
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Seq ID: 12 5 
>chzy_ch-v- 0 0 4_ 
1 gattaagctt 
61 tgtgcgattc, 
121 ctttcagcag 
181 agcaaacagc 
241 ggcgATGGAC 
3 01 GGTGCTCCTC 
361 gtgctaatcg 
421 tagctgaagt 
481 gaatgaataa 
541 gggacagcat 
601 tgaaaacatg 



418-420delGAG 

ttcatgattc ctcatagaac atgaactcaa aagaggtcag 
tttgctattg gctgcagcta tagccctgcc tccttctcca 
cttggctgaa gactgctgtg cagggcaggg aagctccagg 
agcactcagc taaaaggaag actcacagaa cacagttgaa 
CTCATCCCAA ATTTGGCGGT GGAAACCTGG CTTCTCCTGG 
TATCTgtgag taactgtcca aactcctctc tttgtttcct 
ggcccctttt cccttatctg ttttgaagat caaaagagat 
gttggacgct acaaacgcat agaagttatt attatcttat 
ataagcattt ctcccatcca ccttctaatt t tggtgacta 
ttggtagtgg gaatgatttg attagcttag atctgacgaa 
gcagcggcag a 



caaaggggtg 
gcacataaat 



caaacagccc 

gaaggaaagt 

CTGTCAGCCT 

tggacttggg 

gttcaag || aag 

gcagatctat 

ggagggttta 

gactaatcaa 



Seq ID: 126 

>chzx_ch-v- 0 0 5_1 8 7 C>T_ch- v- 0 0 6_1 91-192 insG 

1 aggaaaggac ctgatgagtg aatgcaatta ctgatgttgg agttgctgtt attattbatc 

61 gtgtacatat tacctccctc tcttgaccat tccagttcct gagtaactca ccagccctct 
121 gatctataaa gtcacaatcc ctgtgacctg atttctgttt cactttgtag ATATGGGACC 
181 CGTACa|t1aTG GgACTTTTTA AGAGACTGGG AATTCCAGGG CCCACACCTC TGCCTTTGTT 
241 GGGAAATGTT TTGTCCTATC GTCAGgtgag ttgcttgagc ttcctctttt gcttcttatg 
301 gttgcaaaca tcagcttagt tccatcagta aaaatgcccc tcct tgggag ggagttctga 
361 ggtttcacat tttcagaaat ggtgggactg ggtgcagtgg atcatgcctg taatctcagc 
421 ctctgtgagg ccaagactgg caaattgctt gagcccagga gtttg 

Seq ID: 127 

>peptide of cli-v-005_ch-v-006 

1 MDLIPNLAVE TWLLIiAVSIiV LLYLYGTRT |Y GTF. | 

Seq ID: 128 
>chzw_ch-v~0 07_143C>T 

1 catagacaag ggtgagtcct tcagtactta gagaaaattc aagagtgact ttaaattccc 
61 cacttcaaat atattctctg ttttcttgtc tttcccttaa gacatctctg aatagcttcc 
121 ttcaactgcc agtgaaagat ag[tjaggcctg atttcattgg acgcaactgt tttcagcccc 
181 aattagaggt agggtttatt ctatttaaaa taataatcaa cttgtatttt gtttcctctc 
241 ccagGGTCTC TGGAAATTTG ACACAGAGTG CTATAAAAAG TATGGAAAAA TGTGGGGgtg 
3 01 agtattctga aaacctccat tggatagacc tgctactgtg aggaggttac cccactgcag 
3 61 gatagtctct gcccaggtct tcatgggatg aagctcttgt caacctaaat acaaacagag 
421 agaggttctc tgaaagaaga ggataattac ttgggagtag aatattgcaa tgggaatctg 
481 c ttgccgtta taaactatgt gc aaattcag ggaggtaaac aagacaaaga tgctccatag 
541 aaaatatgag aagaatctca taactgtttt gagataatta ttgttagcta caaagatcaa 
601 ta 



Seq ID: 129 

>chz v_ch- V- 0 0 8_1 9 9 OA 

1 cagtatctct tccctgtttg gaccacatta 
61 ctgtgtgaga ctcttgctgt gtgtcacacc 
121 gtcgtacaac taggggtatg gattacataa 
181 tggctccagc tgcagaat[ajg ggctagtgaa 
241 ACGTATGAAG GTCAACTCCC TGTGCTGGCC 
3 01 CTAGTGAAAG AATGTTATTC TGTCTTCACA 
3 61 tttaaataat gattgatcca ctgattaaat 
421 aaggttacct aaaaaatgta caggaaggtt 
481 ggtaacatct tCTca atctta tatattgcaa 
541 tggcacaaaa gttaaaatgg caaactacag 



cccttcatca tatgaagcct tgggtggctc 
ctaatgaact agaacctaag gttgctgtgt 
cataatgatc aaagtctggc ttcctgggtg 
gtttaatcag ctccgttgtc cccacacagA 
ATCACAGATC CCGACGTGAT CAGAACAGTG 
AATCGAAGGg taagcatcca ttttttgaaa 
ttttattttg aaaaaaacat atattcacag 
ccatgtactc ttcatcctgt ccc gcccagt 
tatatatcta gtatattcat attatcaggt 
9"ctgggcata atggctcatg cctg 



i 
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Seq ID: 130 
>chzv_ch-v- 0 25^ 
1 cagtatctct 
61 ctgtgtgaga 

121 gtcgtacaac 
181 tggctccagc 
241 ACGTATGAAG 
3 01 CTAGTGAAAG 
3 61 tttaaataat 
421 aaggttacct 
481 ggtaacatct 
541 tggcacaaaa 

Seq ID: 131 
>ch.zv_ch-v-009_ 
1 cagtatctct 

61 ctgtgtgaga 
121 gtc gtacaac 
181 tggctccagc 
241 ACGTATGAAG 
3 01 CTAGTGAAAG 
3 61 tttaaataat 
421 aaggttacct 
481 ggtaacatct 
541 tggcacaaaa 



2240T 

tccctgtttg 

ctcttgctgt 

taggggtatg 
tgcagaatcg 
GTCAACTCCC 
AATGTTATTC 
gattgatcca 
aaaaaatgta 
tgca atcttg 



gttaaaatgg 



320OA 

tccctgtttg 

ctcttgctgt 

taggggtatg 

tgcagaatcg 

GTCAACTCCC 

AATGTTATTg 

gattgatcca 

aaaaaatgta 

tgca atcttg 



gttaaaatgg 



gaccacatta 
gtgtcacacc 

gattacataa 
ggctagtgaa 
TGTGCTGGCC 
TGTCTTCACA 
ctgattaaat 
caggaaggtt 
tatattgcaa 
caaactacag 



gaccacatta 
gtgtcacacc 
gattacataa 
ggctagtgaa 
TGTGCTGGCC 
TGTCTTCACA 
ctgattaaat 
caggaaggtt 
tatattgcaa 
caaactacag 



cccttcatca 
ctaatgaact 

cataatgatc 
gtttaatcag 
ATCACAGATC 
AATCGAAGGg 
ttttattttg 
ccatgtactc 
tatatatcta 
gctgggcata 



cccttcatca 
ctaatgaact 
cataatgatc 
gtttaatcag 
ATCACAGATC 
AATCGAAGGg 
ttttattttg 
ccatgtactc 
tatatatcta 
gctgggcata- 



tatgaagcct 
agaacctaag 



tgggtggctc 
gttgctgtgt 



aaagtctggc 
ctc|tjgttgtc 
CCGACGTGAT 
taagcatcca 
aaaaaaacat 
ttcatcctgt 
gtatattcat" 
atggctcatg 



tatgaagcct 
agaacctaag 



ttcctgggtg 
cccacacagA 
CAGAACAGTG 
ttttttgaaa 
atattcacag 
cccgcccagt 
attatcaggt 
cctg 



tgggtggctc 
gttgctgtgt 



aaagtctggc 
ctccgttgtc 
CCGACGTGAT 
taagcatcca 
aaaaaaacat 
ttcatcctgt 
gtatattcat 
atggctcatg 



ttcctgggtg 
cccacacagA 
CAGAACAGTG 
ttttttgaaa 
atattcacag 
ccc gcccagt 
attatcaggt 
cctg 



Seq ID: 132 

>peptide of ch"V-009_320C>A 

1 MWGTYEGQLP VLAITDPDVI RTVLVKECYg VFTNRRSLGP VGFMKSAISL AEDEEWKRIR 



Seq ID: 133 

>chzv_ch-v- 0 1 8_ 
1 cagtatctct 
61 ctgtgtgaga 
121 gtc gtacaac 
181 tggctccagc 
241 ACGTATGAAG 
3 01 CTAGTGAAAG 
3 61 tttaaataat 
421 aaggttacct 
481 gcccagtggt 



541 atcaggttgg 
601 g 



441-444insCTAAAAAAT 

tccctgtttg gaccacatta cccttcatca tatgaagcct tgggtggctc 

ctcttgctgt gtgtcacacc ctaatgaact agaacctaag gttgctgtgt 

taggggtatg gattacataa cataatgatc aaagtctggc ttcctgggtg 

tgcagaatcg ggctagtgaa gtttaatcag ctccgttgtc cccacacagA 

GTCAACTCCC TGTGCTGGCC ATCACAGATC CCGACGTGAT CAGAACAGTG 

AATGTTATTC TGTCTTCACA AATCGAAGGg taagcatcca ttttttgaaa 

gattgatcca c tgattaaat ttttattttg aaaaaaacat atattcacag 

aaaaaatgta c |ctaaaaaat| gaaggttcca tgtactcttc atcctgtccc 

aacatcttgc a atcttqtat attgcaatat atatctagta tattcatatt 

cacaaaagtt aaaatggcaa actacaggct gggcataatg gctcatgcct 



Seq ID: 134 
>chz t_ch- V- 0 1 6_ 
1 agcggaaaac 
61 aatctgggga 
121 tcgagcttcc 
181 tacaaaatgt 
241 tgcacattta 
3 01 GTATGGAGAT 
3 61 CTTGAAAGAg 
421 tgcctgccat 
481 ggca gcgctg 



145T>G 

tcaaggaggt 
caggtagaaa 
taaaaatggt 
cacttactgc 
actacaggta 
GTATTGGTGA 
taagtaggag 
ggagtcgaca 
caactccaaa 



atgaaaataa 
gtaagatcac 
cttt|g|atctt 
tccatgctgg 
ctgatctgtt 
GAAACTTGAG 
cacagccatg 
gtcgcactgt 
gag ccaccta 



541 cagcaaggga 
601 ctt 



aaagggcctt ctctcctgtg 



gatgagtctt 
agtccgtttc 
tatgtacaga 
agaaagccat 
ttgtgcttag 
GCGGGAAGCA 
gggttctgag 
tgggttactc 
agagggagtg 
cacaggagcc 



aattagaaat 
caaggggtag 



gtaaagaatg 
tccactgagt 



aaagacatca 
atccttctgg 
ATGTTCCCCA 
GAGAAAGGCA 
ctgtcatgag 
cagtgaccag 
gctcccatga 
aggatttact 



caaaattcat 
gacttgagtc 
TCATTGCCCA 
AGCCTGTCAC 
cccttccagc 
acaaaagcag 
ggcggcaagt 
tatctgttaa 
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Seq ID: 13 5 

>chz t_ch- 0 1 9J2 4 1T>C 

1 agcggaaaac tcaaggaggt atgaaaataa 
61 aatctgggga caggtagaaa gtaagatcac 
121 tcgagcttcc taaaaatggt cttttatctt 
181 tacaaaatgt cacttactgc tccatgctgg 
241 |c|gcacattta actacaggta ctgatctgtt 
3 01 GTATGGAGAT GTATTGGTGA GAAACTTGAG 
361 CTTGAAAGAg taagtaggag cacagccatg 
421 tgcctgccat ggagtcgaca gtcgcactgt 
481 ggca gcgctg caactccaaa gag ccaccta 
541 cagcaaggga aaagggcctt ctctcctgtg 
601 ctt 



gatgagtctt aattagaaat gtaaagaatg 
agtccgtttc caaggggtag tccactgagt 
tatgtacaga aaagacatca caaaattcat 
agaaagccat atccttctgg gacttgagtc 
ttgtgcttag ATGTTCCCCA TCATTGCCCA 
GCGGGAAGCA GAGAAAGGCA AGCCTGTCAC 
gggttctgag ctgtcatgag cccttccagc 
tgggttactc cagtgaccag acaaaagcag 
agagggagtg gctcccatga ggcggcaagt 
cacaggagcc aggatttact tatctgttaa 



Seq ID: 13 6 

>ch2q_ch-v- 0 1 0_13 2 -13 3 insGTC 

1 attggacatg atagctagat ttgtttcagg aaaacatcct gctttccaag gatttagatg 

61 aatgtttttg ttcactggtg actcaggtaa cacgtcttca agaagccata gggaggttg a 

121 gggagggaag tc |gtc| aagaa gggaggttga ggactgcact tttgatttac ttctgaettc 

181 acgagtcact ttctgccaaa gaaatctctc cttttgcttc tagCACCGAC TAGATTTCCT 

241 TCAGCTGATG ATTGACTCCC AGAA.TTCGAA AGAAACTGAG TCCCACAAAG gtaaccaagg 

3 01 agtgcttctg agggctactg gcggggacac taagagggag ggccttgttc tgaaaatgtg 

3 61 caggaagtat tccaggaaga tgagaatttt tgccacatag cagaacaaca cacatttaga 

421 tgttataaat ggtagctgga ggcactttcc agaagcccac aggtatagcc atgttccagg 

481 ctgaaagggc aaccctaagc aaacct agaa tgcttgga'gg' acagtc agtg gtttgtggat 

541 cacctacatg agatcaaatg ccagttctca gcctcctcca gatccaccaa gtgagaacct 

601 ctacttggaa atttatatca aacata 

Seq ID: 137 
>chzq_ch-v-011__3 64G>T 

1 attggacatg atagctagat ttgtttcagg aaaacatcct gctttccaag gatttagatg 

61 aatgtttttg ttcactggtg actcaggtaa cacgtcttca agaagccata gggaggttg a 

121 gggagggaag tcaagaaggg aggttgagga ctgcactttt gatttacttc tgacttcacg 

181 agtcactttc tgccaaagaa atctctcctt ttgcttctag CACCGACTAG ATTTCCTTCA 

241 GCTGATGATT GACTCCCAGA ATTCGAAAGA AACTGAGTCC CACAAAGgta accaaggagt 

3 01 gcttctgagg gctactggcg gggacactaa gagggagggc cttgttctga aaatgtgcag 

3 61 gaa[t]tattcc aggaagatga gaatttttgc cacatagcag aacaacacac atttagatgt 

421 tataaatggt agctggaggc actttccaga agcccacagg tatagccatg ttccaggctg 

481 aaagggcaac cctaagcaaa cct agaatgc ttggaggaca gtc agtggtt tgtggatcac 

541 ctacatgaga tcaaatgcca gttctcagcc tcctccagat ccaccaagtg agaacctcta 
601 cttggaaatt tatatcaaac ata 

Seq ID: 138 
>chzp_ch- V- 0 1 2__2 6 9 G>A 

1 agataaagta cttttaggat cattcaaggc acacacccat aacactgagt atgtaagaca 

61 gaaatgctct ctctggaaat tacagcagtg ctggtgctgg gatgccatga tgaggagtgt 

121 gtggcccaca atcatgtaga ccttgggaaa acctggatta aaatgatttt gcgtcatcct 

181 ggccctgtat aagatacata tcagaatgaa aaccactccc agtgtgactt tgaattgctt 

241 ttccattttt tcttcttggg attagagagc ttcacttaga tttcatctaa gctgtgatgt 

301 tgtacgttga cctgatttac ctaaaatgtc tttcctctcc tttcagCTCT GTCTGATCTG 

3 61 GAGCTCGCAG CCCAGTCAAT AATCTTCATT TTTGCTGGCT ATGAAACCAC CAGCAGTGTT 

421 CTTTCCTTCA CTTTATATGA ACTGGCCACT CACCCTGATG TCCAGCAGAA ACTGCAAAAG 

481 GAGATTGATG CAGTTTTGCC CAATAAGgtg aggggatgac ccctggagat gaagggaaga 

541 ggtgaagcct tagcaaaaat gcctcctcac cactccccag gagaattttt ataaaaagca 

601 taatcactga ttccttcact gacataatgt aggaagcctc tgaggagaaa aacaaaggga 

661 gaaacataga gaacggttgc tactggcaga agcataagat ctttgt acaa tattgctggc 

721 cctggt tcac ctgtttactg ttatcacaat aatgctaagt aaaaaaaaaa aaaaaaaaaa 
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7 81 aaaaaaaaaa aggagtgtgg cgagaagatg gccaaacagg aacagc 



Seq ID: 139 
>chzm_ch-v- 0 13„ 
1 gtccctgggg 
61 agtaggtcac 
121 gatca tttac 
181 actcagtfcga 
241 CACCACCTAC 
3 01 CACTCAGATT 
3 61 TCAATGGGGT 
421 ATGACCCAAA 
481 ggaaatggag 
541 atgacaatcg 
601 taattatcaa 
6 61 ccagcacaag 
721 atttttaaaa 
781 taacaggaaa 

Seq ID: 140 
>ch.zm_ch- V- 0 1 7_ 
1 gtccctgggg 
61 agtaggtcac 
121 gatca tttac 
181 actcagttga 
241 CACCACCTgA 
3 01 ACACTCAGAT 
3 61 ATCAATGGGG 
421 CATGACCCAA 
481 gggaaatgga 
541 catgacaatc 
601 ataattatca 
6 61 cccagcacaa 
721 gatttttaaa 
781 ttaacaggaa 



167A>G 

tgaggatggt 

tgagcacatc 

cagtatgagt 

gagttaattc 

CTATGATGCC 

ATTCCCAGTT 

ATTCATTCCC 

GTACTGGACA 

ctcaccctga 

tgtggttgta 

actttacaaa 

acaaagataa 



cttgaatatc 

aatggacatg 

tattctctgg 

aaaatctcag 

GTGGTACAGA 

GCTATTAGAC 

AAAGGGTCAA. 

GAGCCTGAGG 

cccaggctgg 

caatcatttg 

ccacagacta 

tcaattatgt 



tcctacattc 
ccagttatta 
agcttctaat 
attatccaat 
TGGAGTACCT 
TTGAGAGGAC 
TGGTGGTGAT 
AGTTCCGCCC 
ttcaagcata 
cttgtaagtc 
gaaaaaacga 
ccctgtgggc 



ataactcctc 
aaatacttca 



attagaatgg 
ctatc 



tatcactttt tatttggttt 



248-249insT 

tgaggatggt cttgaatatc tcctacattc 

tgagcacatc aatggacatg ccagttatta. 

cagtatgagt tattctctgg agcttctaat 

gagttaattc aaaatctcag attatccaat 
CCTATGATGC CGTGGTACAG ATGGAGTACC 

TATTCCCAGT TGCTATTAGA CTTGAGAGGA 

TATTCATTCC CAAAGGGTCA ATGGTGGTGA 

AGTACTGGAC AGAGCCTGAG GAGTTCCGCC 

gctcaccctg acccaggctg gttcaagcat 

gtgtggttgt acaatcattt gcttgtaagt 

aactttacaa accacagact agaaaaaacg 

g acaaagata atcaattatg tccctg tggg 

aattagaatg gtatcacttt ttatttggtt 
actatc ' 



acttca|^tag 

tctgtttctt 

TGACATGGTG" 

TTGCAAGAAA 

TCCAACTTAT 

TGAAAGgtac 

ttctgcctct 

tttttatcac 

aactacatcc 

atttttctac 

gaattgctgc 



ataactcctc 
aaatacttca 



cacacatctc 
cgaatactat 



tactgcatgg 

tccttccagG 
^GTGAATGAAA 
GATGTTGAAA 
GCTCTTCACC 
aagtctccag 
cttaatctac 
aaaaaagtga 
atccacagtc 
gcctatatag 
ttacttgatt 



cacacatctc 
cgaatactat 



acttcaatag 

tctgtttctt 
TTGACATGGT 
CTTGCAAGAA 
TTCCAACTTA 
CTGAAAGgta 
attctgcctc 
ctttttatca 
aaactacatc 
catttttcta 
tgaattgctg 



tactgcatgg 

tccttccagG 
GGTGAATGAA 
AGATGTTGAA 
TGCTCTTCAC 
caagtctcca 
tcttaatcta 
caaaaaagtg 
catccacagt 
cgcctatata 
cttacttgat 



Seq ID: 141 

>peptide of ch-v-017_248-249insT 

1 ALSDLELAAQ SIIFXFAGYE TTSSVLSFTL YELATHPDVQ QKLQKEIDAV LPNKAPP^] 



Seq ID: 142 
>chzm_ch-v- 0 1 4_ 
1 gtccctgggg 
61 agtaggtcac 
121 gatca tttac 
181 actcagttga 
241 CACCACCTAC 
3 01 CACTCAGATT 
3 61 TCAATGGGGT 
421 ATGACCCAAA 
481 ggaaatggag 
541 atgacaatcg 
601 taattatcaa 
661 ccagcacaag 
721 atttttaaaa 
781 taacaggaaa 



.6430T 
tgaggatggt 
tgagcacatc 
cagtatgagt 
gagttaattc 
CTATGATGCC 
ATTCCCAGTT 
ATTCATTCCC 
GTACTGGACA 
ctcaccctga 
tgtggttgta 
actttacaaa 
acaaagataa 



cttgaatatc 
aatggacatg 
tattctctgg 
aaaatctcag 
GTGGTACAGA 
GCTATTAGAC 
AAAGGGTCAA 
GAGCCTGAGG 
cccaggctgg 
caatcatttg 
ccacagacta 
tcaattatgt 



tcctacattc 
ccagttatta 
agcttctaat 
attatccaat 
TGGAGTACCT 
TTGAGAGGAC 
TGGTGGTGAT 
AGTTCCGCCC 
ttcaagcata 
cttgtaagtc 
gaaaaaacga 
ccctg tgggc 



ataactcctc 
aaatacttca 



cacacatctc 
cgaatactat 



attagaatgg 
ctatc 



tatcactttt tatttggttt 



acttcaatag 
tctgtttctt 
TGACATGGTG 
TTGCAAGAAA 
TCCAACTTAT 
TGAAAGgtac 
ttctgcctct 
tttttatcac 
aa[t]tacatcc 
atttttctac 
gaattgctgc 



tactgcatgg 
tccttccagG 
GTGAATGAAA 
GATGTTGAAA 
GCTCTTCACC 
aagtctccag 
cttaatctac 
aaaaaagtga 
atccacagtc 
gcctatatag 
ttacttgatt 
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Seq ID: 143 
>ch2:n_ch-v- 0 1 5„ 
1 caggcctggc 
61 ttttactatc 

121 tgttaagagt 
181 tccatctttt 
241 gATCCCCTTG 
3 01 GGTGGATTCA 
361 ggtcttcaag 
421 gaaataaaga 
481 ttgagctctc 
541 aggtaagtga 
601 cctctagtta 



.351T>C 

acagagtcag 

cagtatttac 

cgtttttata 

cttctttcac 

AAATTAGACA 

AGAGATGGAA 

aaagctgtgc 

tgggcttaat 

cc agagtctg 



tgctccataa 
ccagcttata 
tgcttgcaaa 
ttcatttatt 
CGCAAGGACT 
CCCTAAGTGG 
cccagaacac 
ctaatgtact 
tgtagagtgt 



atattttgtt 
gattaagtat 
gcatttttgt 
aattctccat 
TCTTCAACCA 
AGAATGAgtt 
cagagatttc 
gcatgagtag 
tgtgca ttat 



aaacgatgga 
gaagagttca 



tggtgagtgc 
agatacabgg 



cagataggta 
gcatta 



gactcagctt ctctgcttct 



catatttttt 
atgcttgttt 
GAAAAACCCA 
attctaagga 
aacttagtca 
ttggtgattt, 
gtagtataaa 
cataggacta 



ctactttgct 
aactattgta 
TTGTTCTAAA 
[c|ttctacttt 
ataaaacctt 
tgtacattca 
ggaggtgacc 
cctctaccca 



Seq ID: 145 

>peptide of ch~v-0 01_406C>A 

..1 PVAIRLERTC KKDVEINGVF IPKGSMWIP gYALHHDPKY WTEPEEFRPE RFSKKKDSID 



Seq ID: 206 

>chyu_ch-v- 048^ 
1 catttagtcc 
61 ttctttttga 
121 tacccacgta 
181 ctttaaagag 
241 tcatatgaag 
301 actagaacct 
361 atcaaagtct 
421 cagctccgtt 



2 06G>A 

ttgtgagcac 

taatgaagta 

tgtaccaccc 

ctcttttgtc 

jccttgggtgg 

aaggttgctg 

ggcttcctgg 

gtccccacac 



ttgatgattt 
ttttaaacat 
agcttaacga 
tttca[]tatc 

ctcctgtgtg 
tgtgtcgtac 
gtgtggctcc 
agAA 



acctgccttc aatttttcac tgacctaata 
ataaaacatt atggagagtg gcataggaga 
at gctctact gtcatttcta accataatct 
tcttccctgt ttg gaccaca ttacccttca 
agactcttgc' tgtgtgtcac accctaatga 
aactaggggt atggattaca taacataatg 
agctgcagaa tcgggctagt gaagtttaat 



Seq ID: 207 
>cl:izk_ch- V- 0 3 7_ 
1 tggtcaccca 
61 ccagactgtg 
121 gggtggtaga 
181 gaggaaatgg 
241 tgtctggctg 
3 01 gcagagggaa 
3 61 attctggaat 
421 atgccaatgg 
481 aactgattaa 
541 ggtgtgtgcg 
601 aaacatttca 
6 61 gcccagcaaa 
721 aagtggcgat 
781 gcctggtgct 



23 0T>C 

ccatgtgtac 

cccttgagga 

gaggaaatag 

ttacatctgt 

ggcttgcaag 

ttgcatgggt 

gaagacagcc 

ctccacttca 

gttgtttatg 

attctttgct 

gcagcttgac 

caacagcaca 

ggacctcatc 

cctctatctg 



agtaccctgc 
actcacctct 
gacaatagga 
gtgaggaggt 
gatgtgtagg 
aaagatctgc 
atggaaacaa 
gtttctgata 
attccccata 
actggctgca 
ctaagactgc 
cagctgaaag 
ccaaatttgg 
tcagtaactg 



tagggtccag 
gctaagggaa 
ctgtgtgagg 
tggtaaggaa 
agtcatctag 
agttgtggct 
gggcaggtga 
agaactcagg 
gaatatgaac 
gctgcagccc 
tgtgcagggc 
taagactcag 
cggtggaaac 
tccagattcc 



ggtcatgaaa 
acaggcacag 



gtaaataata 
aaacccacaa 



gggataggag 
agactttaa|c[ 
ggggcacaag 
tgtggggatg 
gaggatattt 
ttccgtggac 
tcaaaggagg 
cacctccttc 
agggatgctc 
aggagacagt 
ctggcttctc 
tctcctctgt 



gcacccagag 
agaaggggtc 
tacactGcag 
gatttcaagt 
aagaggcttc 
tccctgataa 
taagcaaagg 
tccagcacat 
caggcagaca 
t gaagaaggc 
ctggctgtca 
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Seq ID: 208 
>chzk_ch-v- 0 3 8_5 0 60T 

1 tggtcaccca ccatgtgtac agtaccctgc tagggtccag ggtcatgaaa gtaaataata 

61 ccagactgtg cccttgagga actcacctct gctaagggaa acaggcacag aaacccacaa 

121 gggtggtaga gaggaaatag gacaatagga ctgtgtgagg gggataggag gcacccagag 

181 gaggaaatgg ttacatctgt gtgaggaggt tggtaaggaa agactttaat agaaggggtc 

241 tgtctggctg ggcttgcaag gatgtgtagg agtcatctag ggggcacaag tacactccag 

3 01 gcagagggaa ttgcatgggt aaagatctgc agttgtggct tgtggggatg gatttcaagt 

3 61 attctggaat gaagacagcc atggaaacaa gggcaggtga gaggatattt aagaggcttc 

421 atgccaatgg ctccacttca gtttctgata agaactcagg ttccgtggac . tccctgataa 

481 aactgattaa gttgtttatg attcc[t]cata gaatatgaac tcaaaggagg taagcaaagg 

541 ggtgtgtgcg attctttgct actggctgca gctgcagccc cacctccttc tccagcacat 

601 aaacatttca gcagcttgac ctaagactgc tgtgcagggc agggatgctc caggcagaca 

661 gcccagcaaa caacagcaca cagctgaaag taagactcag aggagacagt t gaagaaggc 

721 aagtggcgat ggacctcatc ccaaatttgg cggtggaaac ctggcttctc ctggctgtca 

781 gcctggtgct cctctatctg tcagtaactg tccagattcc tctcctctgt 

Seq ID: 209 
>chzk_ch-v-039_514T>C 

1 tggtcaccca ccatgtgtac agtaccctgc tagggtccag ggtcatgaaa gtaaataata 

61 ccagactgtg cccttgagga actcacctct gctaagggaa acaggcacag aaacccacaa 

121 gggtggtaga gaggaaatag gacaatagga ctgtgtgagg gggataggag gcacccagag 

181 gaggaaatgg ttacatctgt gtgaggaggt tggtaaggaa agactttaat agaaggggtc 

.241 tgtctggctg ggcttgcaag gatgtgtagg agtcatctag. ggggcacaag tacactccag 

3 01 gcagagggaa ttgcatgggt aaagatctgc agttgtggct tgtggggatg gatttcaagt 

3 61 attctggaat gaagacagcc atggaaacaa gggcaggtga gaggatattt aagaggcttc 

421 atgccaatgg ctccacttca gtttctgata agaactcagg ttccgtggac tccctgataa 

481 aactgattaa gttgtttatg attccccata gaa|c]atgaac tcaaaggagg taagcaaagg 

541 ggtgtgtgcg attctttgct actggctgca gctgcagccc cacctccttc tccagcacat 

601 aaacatttca gcagcttgac ctaagactgc tgtgcagggc agggatgctc caggcagaca 

661 gcccagcaaa caacagcaca cagctgaaag taagactcag aggagacagt t gaagaaggc 

721 aagtggcgat ggacctcatc ccaaatttgg cggtggaaac ctggcttctc ctggctgtca 
781 gcctggtgct cctctatctg tcagtaactg tccagattcc tctcctctgt 

Seq ID: 210 

>chzh_ch-v-051_455T>G . I 

1 caaaaattag ctaggtgtgg tggtatgcac ctataatctc agctacccag gaggctgagg 
61 caggagaatc acttgaacct ggaggcagag gttgcagtga gccgagacgc accattacac 
121 tccagcctgg gtgacagagt gagattccat ctcaaaaaaa aaaaaaaaaa attatgcctt 
181 tttgaagcac atacatttta taacatacaa ctgaatccct tattatatta ttagttttga 
241 tttaatgttt tcaaaccatc tcccctgata tttctgggag atgggaaaca tgttttctta 
3 01 cacctcttgc attccattct caactcccaa ctgtcttact gcaatgaaca cttaataaga 
3 61 aacagtcaat tggtcaattg attgggcaac aggctaaaca cactcattcc ttgtctgttc 
421 ccacttcttt ctttactttc ccttcctgag taac|gtatcc taaagtcatt aggtgggtgg 
481 cagccagatg gtggccacac attaaggtag aaaagagagt gtcatgatgg ttccaagtca 
541 gagacctagt agggtgagga tcaagtaggt gttcacgtgg agaaacagcc cggcctgtgt 
601 gtgggagtcc aagcaagcag agaaaatgtc gacacagagg ggtggcctga aaaagcagcc 
661 agagcctaaa cagggcatgg agaacatatt tagggcatga ggtgaggagg gcatccatga 
721 gtgggaaggg atgggtg agq tttcactaca taaaggggat tgatgaaata agtaaataaa 
781 gtatactgga agccaggtgt gtcacttttg cagaaaagag tcatggattc agaaagg 
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Seq ID: 211 
>chzh_ch-v- 0 5 2_ 
1 caaaaattag 
61 caggagaatc 
121 tccagcctgg 
181 tttgaagcac 
241 tttaatgttt 
301 cacctcttgc 
3 61 aacagtcaat 
421 ccacttcttt 
481 cagccagatg 
541 gagacctagt 
601 gtgggagtcc 
661 agagcctaaa 
721 gtgggaaggg 



.577G>A 

ctaggtgtgg 

acttgaacct 

gtgacagagt 

atacatttta 

tcaaaccatc 

attccattct 

tggtcaattg 

ctttactttc 

gtggccacac 

agggtgagga 

aagcaagcag 

cagggcatgg 

atgggtgagg 



781 gtatactgga agccaggtgt 



Seq ID: 212 
>ch2w__ch-v- 0 5 3_ 
1 catagacaag 
61 cacttcaaat 
121 ttcaactgcc 
181 aattagaggt 
241 ccagGGTCTC 
3 01 agtattctga 
3 61 gatagtctct 
421 agaggttctc 
481 cttgccgtta 



163G>A 

ggtgagtcct 

atattctctg 

agtgaaagat 

agggtttatt 

TGGAAATTTG 

aaacctccat 

gcccaggtct- 

tgaaagaaga 

taaactatgt 



541 aaaatatgag 
601 ta 

Seq ID: 213 
>chzw_cli- V- 0 5 4„ 
1 catagacaag 
61 cacttcaaat 
121 ttc aactgcc 
181 aattagaggt 
241 ccagGGTCTC 
3 01 agtattctga 
361 gatagtctct 
421 agaggttctc 
481 c ttgccgtta 



541 aaaatatgag 
601 ta 

Seq ID: 214 
>clizt_ch~v-043_ 
1 agcggaaaac 
61 aatctgggga 
121 tcgagcttcc 
181 tacaaaatgt 
241 tgcacattta 
3 01 GTATGGAGAT 
3 61 CTTGAAAGAg 
421 tgcctgccat 
481 ggca gcgctg 



aagaatctca 



444T>A 

ggtgagtcct 

atattctctg 

agtgaaagat 

agggtttatt 

TGGAAATTTG 

aaacctccat 

gcccaggtct 

tgaaagaaga 

taaactatgt 



tggtatgcac 
ggaggcagag 
gagattccat 
taacatacaa 
tcccctgata 
caactcccaa 
attgggcaac 
ccttcctgag 
attaaggtag 
tcaagtaggt 
agaaaatgtc' 
agaacatatt 
tttcactaca 
gtcacttttg 



tcagtactta 
ttttcttgtc 
agcaggcctg 
ctatttaaaa 
ACACAGAGTG 
tggatagacc 
tcatgggatg 
ggataattac 
gcaaattcag 
taactgtttt 



aagaatctca 



_294T>C 
tcaaggaggt 
caggtagaaa 

taaaaatggt 
cacttactgc 
actacaggta 
GTATTGGTGA 
taagtaggag 
ggagtcgaca 
caactccaaa 



tcagtactta 
ttttcttgtc 
agcaggcctg 
ctatttaaaa 
ACACAGAGTG 
tggatagacc 
tcatgggatg 
gga[a|aattac 
gcaaattcag 
taactgtttt 



atgaaaataa 
gtaagatcac 

cttttatctt 
tccatgctgg 
ctgatctgtt 
GAAACTTGAG 
cacagccatg 
gtcgcactgt 
gag ccaccta 



541 cagcaaggga aaagggcctt ctctcctgtg 



ctataatctc 
gttgcagtga 
ctcaaaaaaa 
ctgaatccct 
tttctgggag 
ctgtcttact 
aggctaaaca 
taacttatcc 
aaaagagagt 
gttcacjajtgg 
gacacagagg 
tagggcatga 
taaaggggat 
cagaaaagag 



gagaaaattc 
tttcccttaa 
atttcattgg 
taataatcaa 
CTATAAAAAG. 
tgctactgtg 
aagctcttgt 
ttgggagtag 
ggaggtaaac 
gagataatta 



gagaaaattc 
tttcccttaa 
atttcattgg 
taataatcaa 
CTATAAAAAG 
tgctactgtg 
aagctcttgt 
ttgggagtag 
ggaggtaaac 
gagataatta 



gatgagtctt 
agtccgtttc 

tatgtacaga 
agaaagccat 
ttgtgcttag 
GCGGGAAGCA 
gggttctgag 
tgggttactc 
agagggagtg 
cacaggagcc 



agctacccag 
gccgagacgc 



gaggctgagg 
accattacac 



aaaaaaaaaa 
tattatatta 
atgggaaaca 
gcaatgaaca 
cactcattcc 
taaagtcatt- 
gtcatgatgg 
agaaacagcc 
ggtggcctga 
ggtgaggagg 
tgatgaaata 
tcatggattc 



aagagtgact 
gacatctctg 
ac[a|caactgt 
cttgtatttt 
TATGGAAAAA 
aggaggttac 
caacctaaat 
aatattgcaa 
aagacaaaga 
ttgttagcta 



attatgcctt 
ttagttttga 
tgttttctta 
cttaataaga 
ttgtctgttc 
.aggtgggtgg 
ttccaagtca 
cggcctgtgt 
aaaagcagcc 
gcatcc atga 
agtaaataaa 
agaaagg 



ttaaattccc 
aatagcttcc 



aagagtgact 
gacatctctg 



tttcagcccc 
gtttcctctc 
TGTGGGGgtg 
cccactgcag 
acaaacagag 
tgggaatctg 
tgctccatag 
caaagatcaa 



ttaaattccc 
aatagcttcc 



acgcaactgt 
cttgtatttt 
TATGGAAAAA 
aggaggttac 
caacctaaat 
aatattgcaa 
aagacaaaga 
ttgttagcta 



aattagaaat 
caaggggtag 



tttcagcccc 
gtttcctctc 
TGTGGGGgtg 
cccactgcag 
acaaacagag 
tgggaatctg 
tgctccatag 
caaagatcaa 



gtaaagaatg 
tccactqagt 



aaagacatca 
atccttctgg 
ATGTTCCCCA 
GAGAAAGGCA 
ctgtcatgag 
cagtgaccag 
gctcccatga 
aggatttact 



caaaattcat 
gacttgagtc 
TCAgTGC.CCA 
AGCCTGTCAC 
cccttccagc 
acaaaagcag 
srsrcggcaagt 
tatctgtta 
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601 ctt 



Seq ID: 215 

>peptide of ch-v-043_294T>C 

1 SLAEDEEWKR IRSLLSPTFT SGKLKEMFPI @AQYGDVLVR NLRREAEKGK PVTLKDIFGA 



Seq ID: 216 
>chzt_„ch-v-055„ 
1 agcggaaaac 
61 aatctgggga 

121 tcgagcttcc 
181 tacaaaatgt 
241 tgcacattta 
3 01 GTATGGAGAT 
3 61 CTTGAAAGAg 
421 tgcctgccat 
481 ggcag cgctg 



.444G>A 

tcaaggaggt 

caggtagaaa 

taaaaatggt 
cacttactgc 
actacaggta 
GTATTGGTGA 
taagtaggag 
ggagtcgaca 
caactccaaa 



atgaaaataa 
gtaagatcac 

cttttatctt 
tccatgctgg 
ctgatctgtt 
GAAACTTGAG 
cacagccatg 
gtcja|cactgt 
gag ccaccta 



541 cagcaaggga 
601 ctt 

Seq ID: 217 
>cliz s_ch- v~ 0 5 0_ 
1 cagagcttac 
61 tgggtctctg 
121 ggtgtctcca 
181 ctgagggagt 
241 tgtcttgact 
301 ctgattttaa 
3 61 TGATTACTGG 
421 TTGTGGAGAG 
481 TAAgtatgtg 
541 aattcacata 
601 tcaaaaatat 
661 tcacacactt 



aaagggcctt ctctcctgtg 



437G>A 

atatcttata 

tcttcctatg 

tcacccccag 

agatggaaga 

ggatatgtgg 

ttttccatat 

CACATCATTT 

CACTAA@AAG 

ggctattatt 

tcgtataatt 

gtatgaccat 

tagctgtcaa 



721 caactttctg cctctataga 



Seq ID: 218 

>chzs_ch~v-05 6_ 
1 cagagcttac 
61 tgggtctctg 
121 ^gtgtctcca 
181 ctgagggagt 
241 tgtcttgact 
301 ctgattttaa 
3 61 TGATTACTGG 
421 TTGTGGAGAG 
481 TAAgtatgtg 
541 aattcacata 
601 tcaaaaatat 
661 tcacacactt 



467A>G 

atatcttata 

tcttcctatg 

tcacccccag 

agatggaaga 

ggatatgtgg 

ttttccatat 

CACATCATTT 

CACTAAGAAG 

ggctattatt 

tcgtataatt 

gtatgaccat 

tagctcftcaa 



721 caactttctg cctctataga 



tcatccacac 
actgggctcc 
catagggcca 
tgattcagca 
gtttcttgct 
ctttctccac 
GGAGTGAACA 
TTCCTAAAAT 
tctttctctc 
catccactta 
tactattgta 
caccccacca 
tttgcctatt 



tcatccacac 
actgggctcc 
catagggcca 
tgattcagca 

gtttcttgct 
ctttctccac 
GGAGTGAACA 
TTCCTAAAAT 
tctttctctc 
catccactta 
tactattgta 
caccccacca 
tttgcctatt 



gatgagtctt 
agtccgtttc 
tatgtacaga 
agaaagccat 
ttgtgcttag 
GCGGGAAGCA 
gggttctgag 
tgggttactc 
agagggagtg 
cacaggagcc 



tcaacacatg 
ttgacctcag 
gctccatcac 
gatagttctg 
gcatgtatag 
tcagCATCTT 
TCGACTCTCT 
TTGGTTTCTT 
tttttaaaaa 
aaaggtacaa 
aactaaaatg 
caaaccccac 
ctggacactt 



tcaacacatg 
ttgacctcag 
gctccatcac 
gatagttctg 
gcatgtatag 
tcagCATCTT 
TCGACTCTCT 
TTGGTTTCTT 
tttttaaaaa 
aaaggtacaa 
aactaaaatg 
caaaccccac 
ctggacactt 



aattagaaat 
caaggggtag 



gtaaagaatg 
tccactgagt 



aaagacatca 
atccttctgg 
ATGTTCCCCA 
GAGAAAGGCA 
ctgtcatgag 
cagtgaccag 
gctcccatga 
aggatttact 



ctactgtagt 
aggtgagtct 



caaaattcat 
gacttgagtc 
TCATTGCCCA 
AGCCTGTCAC 
cccttccagc 
acaaaagcag 
ggcggcaagt 
tatctgttaa 



tgtctgataa 
aactcagctt 



tggcaccaga 
aaagtctgtg 
tggaaggacg 
TGGGGCCTAC 
CAACAATCCA 
AGATCCATTA 
taactgcttt 
ttccattgtt 
tttttgtcaa 
tgccctaagc 
catagaaata 



taaccacctt 
gctctttatg 
gtaagaggtg 
AGCATGGATG 
CAAGACCCCT 
TTTCTCTCAA 
cttgacatat 
tttaagataa 
tctagagcoc 
atccaataat 
atatcatt 



ctactgtagt 
aggtgagtct 



tgtctgataa 
aactcagctt 



tggcaccaga 
aaagtctgtg 

tggaaggacg 
TGGGGCCTAC 
CAACAATCCA 
AGATCC|§TTA 
taactgcttt 
ttccattgtt 
tttttgtcaa 
tgccctaagc 
catagaaata 



taaccacctt 
gctctttatg 

gtaagaggtg 
AGCATGGATG 
CAAGACCCCT 
TTTCTCTCAA 
cttgacatat 
tttaagataa 
tctagagccc 
atccaataat 
atatcatt 
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Seq ID: 219 
>chz s_ch-v- 057^ 
1 cagagcttac 
61 tgggtctctg 
121 £gtgtctcca 
181 ctgagggagt 
241 tgtcttgact 
301 ctgattttaa 
3 61 TGATTACTGG 
421 TTGTGGAGAG 
481 TAAgtatgtg 
541 aattcacata 
6 01 tcaaaaatat 
661 tcacacactt 



5830T 

atatcttata 

tcttcctatg 

tcacccccag 

agatggaaga 

ggatatgtgg 

ttttccatat 

CACATCATTT 

CACTAAGAAG 

ggctattatt 

tcgtataatt 

gtatgaccat 

tagctgtcaa 



721 caactttctg cctctataga 



Seq ID: 220 
>ch2 s„ch- V- 0 5 8_ 
1 cagagcttac 
61 tgggtctctg 
121 £gtgtctcca 
181 ctgagggagt 
241 tgtcttgact 
3 01 ctgattttaa 
3 61 TGATTACTGG 
421 TTGTGGAGAG 
481 TAAgtatgtg 
541 aattcacata 
6 01 tcaaaaatat 
661 tcacacactt 



65 0A>G 

atatcttata 

tcttcctatg 

tcacccccag 

agatggaaga 

ggatatgtgg 

ttttccatat 

CACATCATTT 

CACTAAGAAG 

ggctattatt 

tcgtataatt 

gtatgaccat 

tagctgtcaa 



721 caactttctg cctctataga 



Seq ID: 221 
>chzr_ch-v-059_ 
1 agaaggtgcc 
61 tcagtcgccg 
121 catgatgttc 
181 cattttcaat 
241 ACCCCAGTTT 
3 01 AGTAAATCTG 
3 61 gatggtggtt 
421 agcatgtatc 
481 catttggtag 
541 catttaaaac_ 
601 taaaatttta 
661 aaaatatatt 



2 05T>C 

attgatctca 

gcctgaaaga 

ttattcttat 

gatttccttt 

TTGAAGCATT 

TAAACAGAAT 

aaatgacgat 

tgtattttta 

aaataaagaa 

ttgaaatcag 



tcatccacac 
actgggctcc 
catagggcca 
tgattcagca 
gtttcttgct 
ctttctccac 
GGAGTGAACA 
TTCCTAAAAT 
tctttctctc 
catccactta 
tactattgta 
caccccacca 
tttgcctatt 



tcatccacac 
actgggctcc 
catagggcca 
tgattcagca 

gtttcttgct 
ctttctccac 
GGAGTGAACA 
TTCCTAAAAT 
tctttctctc 
catccactta 
tactattgta 
caccccacca 
tttgcctatt 



ctgctgtagt 
agggcaaaca 
tactggtaga 
tgtt[c|gtttt 
AAATGTCTCT 
GAAGAAAAGT 
gtttaggttt 
aaaataaaga 
ggagattggg 
gcacaacaat 



ccacttctct tccctttaat 



tcaacacatg 
ttgacctcag 
gctccatcac 
gatagttctg 
gcatgtatag 
tcagCATCTT 
TCGACTCTCT 
TTGGTTTCTT 
tttttaaaaa 
aaaggtacaa 
aactaaaatg 
caaaccccac 
ctggacactt 



tcaacacatg 
ttgacctcag 
gctccatcac 
gatagttctg 
gcatgtatag 
tcagCATCTT. 
TCGACTCTCT 
TTGGTTTCTT 
tttttaaaaa 
aaaggtacaa 
aactaaaatg 
caaaccccac 
ctggacactt 



ggtgtttcct 
tgataaaagg 
gaaaattata 
gtttttccca 
CTGTTTCCAA 
CGCCTCAACG 
tgataaattt 
cagagaactt 
gaaggagatg 
tagtatgtca 
aaattgtcaa 



ctactgtagt 
aggtgagtct 



tgtctgataa 
aactcagctt 



tggcaccaga 

aaagtctgtg 
tggaaggacg 
TGGGGCCTAC 
CAACAATCCA 
AGATCCATTA- 
taactgcttt 
tt[tjcattgtt 
tttttgtcaa 
tgccctaagc 
catagaaata 



ctactgtagt 
aggtgagtct 



taaccacctt 

gctctttatg 
gtaagaggtg 
AGCATGGATG 
CAAGACCCCT 
TTTCTCTCAA 
cttgacatat 
tttaagataa 
tctagagccc 
atccaataat 
atatcatt 



tgtctgataa 
aactcagctt 



tggcaccaga 
aaagtctgtg 
tggaaggacg 
TGGGGCCTAC 
CAACAATCCA 
AGATCCATTA 
taactgcttt 
ttccattgtt 
tttttgtca]^ 
tgccctaagc 
catagaaata 



atgtatagac 
aatgggttcc 



attgctccag 
cagTACTCTT 
AAGATACCAT 
ACAAACAAAA 
agattttata 
atgtttagaa 
agaatgagtc 
tgatataaac 
aggataaagt 



taaccacctt 
gctctttatg 
gtaagaggtg 
AGCATGGATG 
CAAGACCCCT 
TTTCTCTCAA 
cttgacatat 
tttaagataa 
tctaga gccc 
atccaataat 
atatcatt 



ctgcccttgc 
agttgagaat 



gtaaagtttg 
TCCATTCCTT 
AAATTTTTTA 
Ggtaaaatct 
cacatgatag 
caagagaagc 
agagagatag 
agtattgaga 
ttcctgtttg 
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Seq ID: 222 
>chzr_ch-v- 0 6 0_49 6A>C 

1 agaaggtgcc attgatctca ctgctgtagt 
61 tcagtcgccg gcctgaaaga agggcaaaca 
121 catgatgttc ttattcttat tactggtaga 
181 cattttcaat gatttccttt tgtttgtttt 
241 ACCCCAGTTT TTGAAGCATT AAATGTCTCT 
3 01 AGTAAATCTG TAAACAGAAT GAAGAAAAGT 
3 61 gatggtggtt aaatgacgat gtttaggttt 
421 agcatgtatc tgtattttta aaaataaaga 
481 catttggtag aaata[c|agaa ggagattggg 
541 catttaaaa c ttgaaatcag gcacaacaat 
601 taaaatttta ccacttctct tccctttaat 
661 aaaatatatt 



ggtgtttcct atgtatagac ctgcccttgc 
tgataaaagg aatgggttcc agttgagaat 
gaaaattata attgctccag gtaaagtttg 
gtttttccca cagTACTCTT TCCATTCCTT 
CTGTTTCCAA AAGATACCAT AAATTTTTTA 
CGCCTCAACG ACAAACAAAA Ggtaaaatct 
tgataaattt agattttata cacatgatag 
cagagaactt atgtttagaa- .caagagaagc 
gaaggagatg agaatgagtc agagagatag 
tagtatgtca tgatataaac agtattgaga 
aaattgtcaa aggataaagt ttcctgtttg 



Seq ID: 223 
>chzp_ch-v- 0 6 2_1 7 3G>A 
1 agataaagta cttttaggat 
61 gaaatgctct ctctggaaat 
121 £tggcccaca atcatgtaga 
181 ggccctgtat aagatacata 
241 ttccattttt tcttcttggg 
301 tgtacgttga cctgatttac 
3 61 GAGCTCGCAG CCCAGTCAAT 
421 CTTTCCTTCA CTTTATATGA 
481 GAGATTGATG CAGTTTTGCC 
541 ggtgaagcct tagcaaaaat 
601 taatcactga ttccttcact 
661 gaaacataga gaacggttgc 
'^21 cctggt tcac ctgtttactg 
781 aaaaaaaaaa aggagtgtgg 

Seq ID: 224 
>chzp_ch-v- 0 6 3_3 12 OT 

1 agataaagta cttttaggat 
61 gaaatgctct ctctggaaat 
121 gtggcccaca atcatgtaga 
181 ggccctgtat aagatacata 
241 ttccattttt tcttcttggg 
301 tgtacgttga cctgatttac 
361 GAGCTCGCAG CCCAGTCAAT 
421 CTTTCCTTCA CTTTATATGA 
481 GAGATTGATG CAGTTTTGCC 
541 ggtgaagcct tagcaaaaat 
601 taatcactga ttccttcact 
661 gaaacataga gaacggttgc 
721 cctggt tcac ctgtttactg 
781 aaaaaaaaaa aggagtgtgg 



cattcaaggc acacacccat 
tacagcagtg ctggtgctgg 
ccttgggaaa acctggatta 
tcagaatgaa aaccactccc 
attagagagc ttcacttaga 
ctaaaatgtc tttcctctcc 
AATCTTCATT TTTGCTGGCT. 
ACTGGCCACT CACCCTGATG 
CAATAAGgtg aggggatgac 
gcctcctcac cactccccag 
gacataatgt aggaagcctc 
tactggcaga agcataagat 
ttatcacaat aatgctaagt 
cgagaagatg gccaaacagg 



cattcaaggc acacacccat 
tacagcagty^* ctggtgctgg 
ccttgggaaa acctggatta 
tcagaatgaa aaccactccc 
attagagagc ttcacttaga 
ctaaaatgtc tttcctctcc 
AATCTTCATT TTTGCTGGCT 
ACTGGCCACT CACCCTGATG 
CAATAAGgtg aggggatgac 
gcctcctcac cactccccag 
gacataatgt aggaagcctc 
tactggcaga agcataagat 
ttatcacaat aatgctaagt 
cgagaagatg gccaaacagg 



aacactgagt atgtaagaca 
gatgccatga tgaggagtgt 
aaatgatttt gc§tcatcct 
agtgtgactt tgaattgctt 
tttcatctaa gctgtgatgt 
tttcagCTCT GTCTGATCTG 
ATGAAACCAC CAGCAGTGTT 
TCCAGCAGAA ACTGCAAAAG 
ccctggagat gaagggaaga 
gagaattttt ataaaaagca 
tgaggagaaa aacaaaggga 
ctttgt acaa tattgctggc 
aaaaaaaaaa aaaaaaaaaa 
aacagc 



aacactgagt atgtaagaca 
gatgccatga tgaggagtgt 
aaatgatttt gcgtcatcct 
agtgtgactt tgaattgctt 
tttcatctaa gctgtgatgt 
tttcagCTCT GTCTGATCTG 
ATGAAACCAC CAGCAGTGTT 
TCCAGCAGAA ACTGCAAAAG 
ccctggagat gaagggaaga 
gagaattttt ataaaaagca 
tgaggagaaa aacaaaggga 
ctttgta caa tattgctggc 
aaaaaaaaaa aaaaaaaaaa 
aacagc 
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Seq ID: 225 
>clizn_ch-v- 0 4 4„ 
1 caggcctggc 
61 ttttactatc 
121 tgttaagagt 
181 tccatctttt 
241 gATCCCCTTG 
3 01 GGTGGATTCA 
361 ggtcttcaag 
421 gaaataaaga 
481 ttgagctctc 
541 aggtaagtga 
601 cctctagtta 

Seq ID: 226 
>chzy_ch- V- 0 6 6_ 
1 gattaagctt 
61 tgtgcgattc 
121 ctttcagcag 
181 agcaaacagc 
241 ggcgATGGAC 
3 01 GGTGCTCCTC 
3 61 gtgctaatcg 
421 aagtagctga 
481 tatgaatgaa 
541 ttagggacag 
601 caatgaaaac 

Seq ID: 227 
>ch2y_c]:i-v-0 67_ 
1 gattaagctt 
61 tgtgcgattc 
121 ctttcagcag 
181 agcaaacagc 

2 41 ggcgATGGAC 

3 01 GGTGCTCCTC 
3 61 gtgctaatcg 
421 aagtagctga 
481 tatgaatgaa 
541 ttagggacag 
601 caatgaaaac 

Seq ID: 228 
>chzu_ch-v- 0 6 8_ 
1 gaacagttcc 
61 gagaccacac 
121 gaggctaggc 
181 tacagtcaat 
241 aagatcacca 
3 01 GTGGGATTTA 
3 61 TCATTGCTGT 
421 tcttaattag 
481 tttccaaggg 
541 cagaaaagac 
601 ccatatcctt 



23 9T>C 

acagagtcag 

cagtatttac 

cgtttttata 

cttctttcac 

AAATTAGACA 

AGAGATGGAA 

aaagctgtgc 

tgggcttaat 

cca gagtctg 



tgctccataa 
ccagcttata 
tgcttgcaaa 
ttcatttatt 
CGCAAGGACT 
CCCTAAGTGG 
cccagaacac 
ctaatgtact 
tgtagagtgt 



atattttgtt 
gattaagtat 
gcatttttgt 
aattctccat 
TCTTCAACCA 
AGAATGAgtt 
cagagatttc 
gcatgagtag 
tgtgca ttat 



aaacgatgga 
gaagagttca 



cagataggta 
gcatta 



380T>C 
ttcatgattc 

tttgctattg 
cttggctgaa 
agcactcagc 
CTCATCCCAA 
TATCTgtgag 
ggcccctttjcj 
agtgttggac 
taaataagca 
catttggtag 
atggcagcgg 



474G>A 

ttcatgattc 

tttgctattg 

cttggctgaa 

agcactcagc 

CTCATCCCAA 

TATCTgtgag 

ggcccctttt 

agtgttggac 

taaataagca 

catttggtag 

atggcagcgg 



3 59G>A 

ctaccacgtg 

cagatggctg 

atctaggatc 

aaatatttgt 

caactaatgt 

TGAAAAGTGC 

CTCCAACCTT 

aaatgtaaag 

gtagtccact 

atcacaaaat 

ctgggacttg 



gactcagctt ctctgcttct 



ctcatagaac 

gctgcagcta 
gactgctgtg 
taaaaggaag 
ATTTGGCGGT 
taactgtcca 
cccttatctg 
gctacaaacg 
tttctcccat 



atgaactcaa 
tagccctgcc 
cagggcaggg 
actcacagaa 
GGAAACCTGG 
aactcctctc 
ttttgaagat 
catagaagtt, 
ccaccttcta 



catatttttt 

atgcttgttt 
GAAAAACCCA 
attctaagga 
aacttagtca 
ttggtgattt- 
gtagtataaa 
cataggacta 



aagaggtcag 
tccttctcca 



tggtgagtgc 
agatacatgg 



ctactttgct 
aactattg[c|a 
TTGTTCTAAA 
tttctacttt 
ataaaacctt 
tgtacattca 
ggaggtgacc 
cctctaccca 



caaaggggtg 
gcacataaat 



aagctccagg 
cacagttgaa 
CTTCTCCTGG 
tttgtttcct 
caaaagagat 
attattatct 
attttggtga 



tgggaatgat 
caga 



ctcatagaac 
gctgcagcta 

gactgctgtg 
taaaaggaag 
ATTTGGCGGT 
ta*actgtcca 
cccttatctg 
gctacaaacg 
tttctcccat 



ttgattagct tagatctgac 



atgaactcaa 
tagccctgcc 

cagggcaggg 
actcacagaa 
GGAAACCTGG 
aactcctctc 
ttttgaagat 
catagaagtt 
ccaccttcta 



aagaggtcag 
tccttctcca 



caaacagccc 
gaaggaaagt 
CTGTCAGCCT 
tggacttggg 
gttcaaggag 
tatgcagatc 
ctaggagggt 
gaagactaat 



caaaggggtg 
gcacataaat 



aagctccagg 
cacagttgaa 
CTTCTCCTGG 
tttgtttcct 
caaaagagat 
attattatct 
attt tggtga 



tgggaatgat 
caga 



gagcatttgc 
ggtctcccca 
tccattgagc 
taaataagga 
gagaaaaaat 
CATCTCTTTA 
CACCAGCGGA 
aatgaatctg 
gagttcgagc 



ttgattagct tagatctgac 



aattaaaagg 
ctcccacccc 
atcttgaata 
tgcctcttca 
gtttctgttg 
GCTGAGGATG 
AAACTCAAGG 
gggacaggta 
ttcctaaaaa 



agactgagat 
cgccccacat 



caaacagccc 
gaaggaaagt 
CTGTCAGCCT 
tggacttggg 
gttcaaggag 
tatgcagatc 
ctaggagggt 
gaagactaat 



atagaggcag 
acactcagaa 



tggcttgcca 
atatattttg 
aactctagTC 
AAGAATGGAA 
AGgtatgaaa 
gaaagtaaga 
tggtct ttta 



tcattacaaa 
agtctg 



atgtcactta ctgctccatg 



taatatcata 
tgcaaccatg 
TTTAGGCCCA 
GAGAATACgc 
ataagatgag 
tcacagtccg 
tctttatgta 
ctggagaaag 
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Seq ID: 229 

>peptide of ch-v-068_359G>A 

1 SVFTNRRSLG PVGFMKSAIS LAEDEEWKRI gSLLSPTFTS GKLKEMFPII AQYGDVLVRN 



Seq ID: 23 0 

>chzu_ch- V- 0 47_ 
1 gaacagttcc 
61 gagaccacac 
121 gaggctaggc 
181 tacagtcaat 
241 aagatcacca 
3 01 GTGGGATTTA 
3 61 TCATTGCTGT 
421 tcttaattag 
481 tttccaaggg 
541 cagaaaagac 
601 ccatatcctt 



404T>C 

ctaccacgtg 

cagatggctg 

atctaggatc 

aaatatttgt 

caactaatgt 

TGAAAAGTGC 

CTCCAACCTT 

aaatgtaaag 

gtagtccact 

atcacaaaat 

ctgggacttg 



gagcatttgc 
ggtctcccca 
tccattgagc 
taaataagga 
gagaaaaaaf 
CATCTCTTTA 
CACCAGCGGA 
aatgaatctg 
gagttcgagc 



aattaaaagg 
ctcccacccc 
atcttgaata 
tgcctcttca 
gtttctgttg 
GCTGAGGATG 
AAACTCAAGG 
gggacaggta 
ttcctaaaaa 



agactgagat 
cgccccg.cat. 



tggcttgcca 
atatattttg 
aactctagTC 
AAGAATGGAA 
AGg^atgaaa 
gaaagtaaga 
tggtctttta 



atagaggcag 
acactcagaa 



tcattacaaa 
agtctg 



atgtcactta ctgctccatg 



taatatcata 
tgcaaccatg 
TTTAGGCCCA 
GAGAATACGG 
ataagatgag 
tcacagtccg 
tctttatgta 
ctggagaaag 



Seq ID: 231 
>chzu_ch-v- 0 6 9„ 
1 gaacagttcc 
61 gagaccacac 
121 gaggctaggc 
181 tacagtcaat 
241 aagatcacca 
3 01 GTGGGATTTA 
3 61 TCATTGCTGT 
421 tcttaattag 
481 tttccaaggg 
541 cagaaaagac 
601 ccatatcctt 

Seq ID: 232 
>cii2q_ch~v-'061_ 
1 attggacatg 
61 aatgtttttg 
121 gggagggaag 
181 agtcactttc 
241 GCTGATGATT 
3 01 gcttctgagg 
3 61 gaagtattcc 
421 tataaatggt 
481 aaagggcaac 
541 ctacatgaga 
601 cttggaaatt 



48 0G>A 

ctaccacgtg 

cagatggctg 

atctaggatc 

aaatatttgt 

caactaatgt 

TGAAAAGTGC 

CTCCAACCTT 

aaatgtaaag 

gtagtccact 

atcacaaaat 

ctgggacttg 



1940G . 
atagctagat 
ttcactggtg 
tcaagaaggg 
tgc|^aaagaa 
GACTCCCAGA 
gctactggcg 
aggaagatga 
agctggaggc 
cctaagcaaa 
tcaaatgcca 
tatatcaaac 



gagcatttgc 
ggtctcccca 
tccattgagc 
taaataagga 
gagaaaaaat 
CATCTCTTTA 
CACCAGCGGA 
aatgaatctg 
gagtt cgagc 



aattaaaagg 
ctcccacccc 
atcttgaata 
tgcctcttca 
gtttctgttg 
GCTGAGGATG 
AAACTCAAGG 
gggacaggta 
ttcctaaaaa 



agactgagat 
cgccccacat 



atagaggcag 
acactcagaa 



tggcttgcca 
atatattttg 
aactctagTC 
AAGAATGGAA 
AGgtatgaaa 
gaaagtaaga 
tggtct ttta 



tcattacaaa 
agtctg 



ttgtttcagg 
actcaggtaa 
aggttgagga 
atctctcctt 
ATTCGAAAGA 
gggacactaa 
gaatttttgc 
actttccaga 
cct agaatgc 



atgtcactta ctgctccatg 



aaaacatcct 
cacgtcttca 
ctgcactttt 
ttgcttctag 
AACTGAGTCC 
gagggagggc 
cacatagcag 
agcccacagg 
ttggaggaca 



gctttccaag 
agaagccata 



taatatcata 
tgcaaccatg 
TTTAGGCCCA 
GAGAATACGG 
ataagatgag 
tcacagtcc[a| 
tctttatgta 
ctggagaaag 



gatttagatg 
gggaggttg a 



gatttacttc 
CACCGACTAG 
CACAAAGgta 
cttgttctga 
aacaacacac 
tatagccatg 
gtcagtggtt 



gttctcagcc 
ata 



tcctccagat ccaccaagtg 



tgacttcacg 
ATTTCCTTCA 
accaaggagt 
aaatgtgcag 
atttagatgt 
ttccaggctg 
tgtggatcac 
agaacctcta 



Seq ID: 233 
>chzn_ch-v- 0 4 5_ 
1 caggcctggc 
61 ttttactatc 
-121 tgtta agagt 
181 tccatctttt 
241 gATCCCCTTG 
301 GGTGGATTCA 
3 61 ggtcttcaag 



.291T>C 

acagagtcag 

cagtatttac 

cgtttttata 

cttctttcac 

AAATTAGACA 

AGAGATGGAA 

aaagctgtgc 



tgctccataa 
ccagcttata 
tgcttgcaaa 
ttcatttatt 
CGCAAGGACT 
CCCTAAGTGG 
cccagaacac 



atattttgtt 
gattaagtat 
gcatttttgt 
aattctccat 
TCTTCAACCA 
AGAATGAgtt 
cagagatttc 



aaacgatgga 
gaagagttca 



tggtgagtgc 
agatacatgg 



catatttttt 
atgcttgttt 
GAAAAACCCA 
attctaagga 
aacttagtca 



ctactttgct 
aactattgta 
gTGTTCTAAA 
tttctacttt 
ataaaacctt 
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421 gaaataaaga tgggcttaat ctaatgtact gcatgagtag ttggtgattt tgtacattca 

481 ttgagctctc cca gagtctg tgtagagtgt tgtgca ttat gtagtataaa ggaggtgacc 

541 aggtaagtga cagataggta gactcagctt ctctgcttct cataggacta cctctaccca 
601 cctctagtta gcatta 



Seq ID: 234 

>peptide of ch-v-045_291T>C - 

1 MRFALMNMKL ALIRVLQNFS FKPCKETQIP LKLDTQGLLQ PEKPgVLKVD SRDGTLSGE . 

Seq ID: 23 5 

>chzk_ch-v-0 65__5 63T>C 

1 tggtcaccca ccatgtgtac agtaccctgc tagggtccag ggtcatgaaa gtaaataata 

61 ccagactgtg cccttgagga actcacctct gctaagggaa acaggcacag aaacccacaa 

121 gggtggtaga gaggaaatag gacaatagga ctgtgtgagg gggataggag gcacccagag 

181 gaggaaatgg ttacatctgt gtgaggaggt tggtaaggaa agactttaat agaaggggtc 

241 tgtctggctg ggcttgcaag gatgtgtagg agtcatctag ggggcacaag tacactccag 

3 01 gcagagggaa ttgcatgggt aaagatctgc agttgtggct tgtggggatg gatttcaagt 

3 61 attctggaat gaagacagcc atggaaacaa gggcaggtga gaggatattt aagaggcttc 

421 atgccaatgg ctccacttca gtttctgata agaactcagg ttccgtggac tccctgataa 

481 aactgattaa gttgtttatg attccccata gaatatgaac tcaaaggagg taagcaaagg 

541 ggtgtgtgcg attctttgct acjclggctgca gctgcagccc' cacctccttc tccagcacat 

601 aaacatttca gcagcttgac ctaagactgc tgtgcagggc agggatgctc caggcagaca 

661 gcccagcaaa caacagcaca cagctgaaag taagactcag aggagacagt t gaagaaggc 

721 aagtggcgat ggacctcatc ccaaatttgg cggtggaaac ctggcttctc ctggctgtca 

781 gcctggtgct cctctatctg tcagtaactg tccagattcc tctcctctgt 

Seq ID: 23 6 
>chzl_ch- V- 0 4 0_2 0 60T 

1 ggagtgacct gattttccag gtgctgtctg tcaccccttt ctttgactag gaaagggaac 
61 tccttgaccc cttgcgcttc tcaagtgagg caatgcctcg ccctgcttcg gcttgtgca c 
121 agcacgctgc acccactgtc ctgcacccac tgtctggcac tccctagtga gatgaacccg 
181 gtacctcaga tgaaaatgca gaaat[tlaccc gtcttctgtg tcactcacac tgggagctgt 
241 agaccggagc tgttcctatt cggccatctt ggctccaccc cccgagtttt ggcttttaat 
301 tgaaagtgta ttgatgtggg aaggagataa tgccatgcat ttatgagcac atattagagg 
3 61 gtctgagaca atgcatgtga taaaaggtca cctaaggaag aaaaaaagaa caagggaaga 
421 cactggaaag aacgtgatgc tgggagtccc tgggccacca aagtctggag aaaagtggta 
481 accacaaggc tcccagccta gtttcactga agacctcgac actaggtgac ttatgggatc 
541 cttggtagga caagccttga aaagttttac aatagcaaat gtggacgttg tcagaaccaa 
601 atgatgtcac gtgtgtattt gtgtgtgtgt gtcagtgtgt gtgtttaaaa a tcatgacaa 
6 61 ataaagcagg ctgtg aaqacy gggattccca tgctcgtgtg cctgataaca caactatcac 
721 aaacgctttg cgaaaccac'a agtttgcaca aaggcaatcc caaccttaca caaaa 
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>CYP3A5 insert in pKK233-2 (Seq ID 144) 

1 ccATGgctct gttattagca gtttttCTGG TGCTCCTCTA TCTATATGGG ACCCGTACAC 
61 ATGGACTTTT TAAGAGACTG GGAATTCCAG GGCCCACACC TCTGCCTTTG TTGGGAAATG 
121 TTTTGTCCTA TCGTCAGGGT CTCTGGAAAT TTGACACAGA GTGCTATAAA AAGTATGGAA 
181 AAATGTGGGG AACGTATGAA GGTCAACTCC CTGTGCTGGC CATCACAGAT CCCGACGTGA 
241 TCAGAACAGT GCTAGTGAAA GAATGTTATT [c]tGTCTTCAC AAATCGAAGG TCTTTAGGCC 
3 01 CAGTGGGATT TATGAAAAGT GCCATCTCTT TAGCTGAGGA TGAAGAATGG aagagaatac 
3 61 GGTCATTGCT GTCTCCAACC TTCACCAGCG GAAAACTCAA GGAGATGTTC CCCATCATTG 
421 CCCAGTATGG AGATGTATTG GTGAGAAACT TGAGGCGGGA AGCAGAGAAA GGCAAGCCTG 
481 TCACCTTGAA AGACATCTTT GGGGCCTACA GCATGGATGT GATTAGTGGC ACATCATTTG 
541 GAGTGAACAT CGACTCTCTC AACAATCCAC AAGACCCCTT TGTGGAGAGC ACTAAGAAGT 
601 TCCTAAAATT TGGTTTCTTA GATCCATTAT TTCTCTCAAT AATACTCTTT CCATTCCTTA 
661 CCCCAGTTTT TGAAGCATTA AATGTCTCTC" TGTTTCCAAA AGATACCATA AATTTTTTAA 
721 GTAAATCTGT AAACAGAATG AAGAAAAGTC GCCTCAACGA CAAACAAAAG CACCGACTAG 
781 ATTTCCTTCA GCTGATGATT GACTCCCAGA ATTCGAAAGA AACTGAGTCC CACAAAGCTC 
841 TGTCTGATCT GGAGCTCGCA GCCCAGTCAA TAATCTTCAT TTTTGCTGGC TATGAAACCA 
901 CCAGCAGTGT TCTTTCCTTC ACTTTATATG AACTGGCCAC TCACCCTGAT GTCCAGCAGA 
9 61 AACTGCAAAA GGAGATTGAT GCAGTTTTGC CCAATAAGGC ACCACCTACC TATGATGCCG 
1021 TGGTACAGAT GGAGTACCTT GACATGGTGG TGAATGAAAC ACTCAGATTA TTCCCAGTTG 
1081 CTATTAGACT TGAGAGGACT TGCAAGAAAG ATGTTGAAAT CAATGGGGTA TTCATTCCCA 
1141 AAGGGTCAAT GGTGGTGATT CCAA[c|tTATG CTCTTCACCA TGACCCAAAG TACTGGACAG 
12 01 AGCCTGAGGA GTTCCGCCCT GAAAGGTTCA GTAAGAAGAA GGACAGCATA gatccttaca 
12 61 TATACACACC CTTTGGAACT GGACCCAGAA ACTGCATTGG CATGAGGTTT GCTCTCATGA 
1321 ACATGAAACT TGCTCTAATC AGAGTCCTTC AGAACTTCTC CTTCAAACCT TGTAAAGAAA 
1381 CACAGATCCC CTTGAAATTA GACACGCAAG GACTTCTTCA ACCAGAAAAA CCCATTGTTC 
1441 TAAAGGTGGA TTCAAGAGAT GGAACCCTAA GTGGAGAAca tcaccatcac catcacTGAg 
1501 atctgcag 
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<160> 239 

<170> Patentin Ver. 2.1 

<210> 1 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<400> 1 

acaggcacag aaacccacaa g 21 



<210> 2 
<211> 19 

<212> DNA 

<213> Homo sapiens 

<400> 2 

atcgccactt gccttcttc 19 



<210> 3 

<211> 19 

<212> DNA 

<213> Homo sapiens 



<400> 3 

ccctgcttcg gcttgtgca 



19 
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<210> 4 

<211> 24 

<212> DNA 

<213> Homo sapiens 

<400> 4 

cacagcctgc tttatttgtc atga 24 



<210> 5 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<400> 5 

gatccttggt aggacaagcc t 21 



<210> 6 
<211> 24 
<212> DNA 

<213> Homo sapiens 
<400> 6 

caagcactga tttggtcact tcct 24 



<210> 7 
<211> 22 
<212> DNA 

<213> Homo sapiens 

<400> 7 

gggatgggac cgtaagtgga ac 22 



<210> 8 
<211> 26 
<212> DNA 

<213> Homo sapiens 
<400> 8 

taatcacatt ggagttctga caaatg 26 



<210> 9 
<211> 30 

<212> DNA 

<213> Homo sapiens 



<400> 9 

aaaaacctct tacaaaagta tcatcggata 



30 



<210> 10 

<211> 27 

<212> DNA 

<213> Homo sapiens 



<400> 10 

cctactaggt ctctgacttg gaaccat 



27 
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<210> 11 

<211> 21 

<212> DNA 

<213> Homo sapiens 



<400> 11 

gccgagacgc accattacac t 



21 



<210> 12 

<211> 21 

<212> DNA 

<213> Homo sapiens 



<400> 12 

cacccatccc ttcccactca t 



21 



<210> 13 

<211> 28 

<212> DNA 

<213> Homo sapiens 



<400> 13 

tgatggttcc aagtcagaga cctagtag 



28 



<210> 14 

<211> 35 

<212> DNA 

<213> Homo sapiens 



<400> 14 

aattgtagac atctttctct taagttaatt cccag 



35 



<210> 15 

<211> 25 

<212> DNA 

<213> Homo sapiens 



<400> 15 

tctgcatgcc aacagtgaac aatct 



25 



<210> 16 
<211> 19 

<212> DNA 

<213> Homo sapiens 



<400> 16 

ggcacgcacc agcatgtcc 



19 



<210> 17 

<211> 19 

<212> DNA 

<213> Homo sapiens 



<400> 17 

ctggctgagt gccgtggct 



19 
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<210> 18 
<211> 25 
<212> DNA 

<213> Homo sapiens 



<400> 18 

tgagcgcttc atgtattctg gctat 



25 



<210> 19 

<211> 31 
<212> DNA 

<213> Homo sapiens 



<400> 19 

aaatattttc aaagtcacac tctgacaaca g 



31 



<210> 20 
<211> 29 

<212> DNA 
<213> Homo 



sapiens 



<400> 20 

taacaggatc tcatgctttt ttcatggct 29 



<210> 21 
<211> 30 
<212> DNA 

<213> Homo sapiens 
<400> 21 

cactccaata ttcacaatag ccactattca 30 



<210> 22 

<211> 24 
<212> DNA 

<213> Homo sapiens 
<400> 22 

actcctacgt atccttccaa gccc 24 



<210> 23 
<211> 28 
<212> DNA 

<213> Romo sapiens 
<400> 23 

gctaagggaa acaggcatag aaacttac 28 



<210> 24 
<211> 18 
<212> DNA 
<213> Homo 



sapiens 



<400> 24 

ggagcttccc tgccctgc 



18 
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<210> 25 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<400> 25 

tccttctcca gcacataaat c 21 



<210> 26 

<211> 21 

<212> DNA 

<213> Homo sapiens 



<210> 27 
<211> 21 

<212> DNA 

<213> Homo sapiens 

<400> 27 

gagtaactca ccagccctct g 21 



<210> 28 

<211> 21 

<212> DNA 

<213> Homo sapiens 



<210> 29 

<211> 23 

<212> DNA 

<213> Homo sapiens 

<400> 29 

gacatctctg aatagcttcc ttc 23 



<210> 30 

<211> 21 

<212> DNA 

<213> Homo sapiens 



<400> 26 

aaattagaag gtggatggga g 



21 



<400> 28 

aaacctcaga actccctccc a 



21 



<400> 30 

gcacatagtt tataacggca a 



21 



<210> 31 
<211> 23 
<212> DNA 



<213> Homo sapiens 



<400> 31 

agaacctaag gttgctgtgt gtc 



23 
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<210> 32 
<211> 21 
<212> DNA 

<213> Homo sapiens 

<400> 32 

tgcaagatgt taccactggg c 21 

<210> 33 

<211> 20 
<212> DNA 

<213> Homo sapiens 



<210> 34 
<211> 21 

<212> DNA 

<213> Homo sapiens 

<400> 34 

agaccatttt taggaagctc g 21 



<210> 35 

<211> 22 

<212> DNA 

<213> Homo sapiens 



<210> 36 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 36 

ctctttggag ttgcagcg 18 



<210> 37 

<211> 21 

<212> DNA 

<213> Homo sapiens 



<400> 33 

cgccccacat acactcagaa 



20 



<400> 35 

caaggggtag tccactgagt to 



22 



<400> 37 

aggtgagtct aactcagctt g 



21 



<210> 38 
<211> 21 
<212> DNA 



<213> Homo sapiens 



<400> 38 

gacagctaaa gtgtgtgagg g 



21 
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<210> 39 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<400> 39 

aatgggttcc agttgagaat c 21 

<210> 40 

<211> 21 

<212> DNA 

<213> Homo sapiens 



<210> 41 
<211> 19 

<212> DNA 

<213> Homo sapiens 

<400> 41 

agaagccata gggaggttg 19 

<210> 42 

<211> 20 

<212> DNA 

<213> Homo sapiens 



<210> 43 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<400> 43 

gatgccatga tgaggagtgt g 21 



<210> 44 

<211> 19 

<212> DNA 

<213> Homo sapiens 



<400> 40 

attgttgtgc ctgatttcaa g 



21 



<400> 42 

gactgtcctc caagcattct 



20 



<400> 44 

accagggcca gcaatattg 



19 



<210> 45 
<211> 25 
<212> DNA 



<213> Homo sapiens 



<400> 45 

aaatacttca cgaatactat gatca 



25 
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<210> 46 

<211> 24 

<212> DNA 

<213> Homo sapiens 

<400> 46 

cagggacata attgattatc tttg 



24 



<210> 47 

<211> 19 

<212> DNA 

<213> Homo sapiens 



<400> 47 

tactggttgg gaggtggag 



19 



<210> 48 
<211> 23 

<212> DNA 

<213> Homo sapiens 



<400> 48 

catgatgttc ttaatgctac agg 



23 



<210> 49 

<211> 25 

<212> DNA 

<213> Homo sapiens 



<400> 49 

gaagagttca agatacatgg tgtta 



25 



<210> 50 

<211> 23 

<212> DNA 

<213> Homo sapiens 



<400> 50 

tgcacaacac tctacacaga etc 



23 



<210> 51 

<211> 11 

<212> DNA 

<213> Homo sapiens 



<400> 51 
tgggcttgca a 



11 



<210> 52 

<211> 11 

<212> DNA 

<213> Homo sapiens 



<400> 52 
tgggcgtgca a 



11 
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<210> 53 

<211> 11 

<212> DNA 

<213> Homo sapiens 



<400> 53 
gcatgggtaa a 



11 



<210> 54 

<211> 11 

<212> DNA 

<213> Homo sapiens 



<400> 54 
gcatgagtaa a 



11 



<210> 55 
<211> 11 

<212> DNA 

<213> Homo sapiens 



<400> 55 

ggggtgtgtg c 



11 



<210> 56 

<211> 11 

<212> DNA 

<213> Homo sapiens 



<400> 56 

ggggtatgtg c 



11 



<210> 57 

<211> 11 

<212> DNA 

<213> Homo sapiens 



<400> 57 
tgtgcgattc t 



11 



<210> 58 

<211> 11 

<212> DNA 

<213> Homo sapiens 



<400> 58 
tgtgcaattc t 



11 



<210> 59 

<211> 11 

<212> DNA 

<213> Homo sapiens 



<400> 59 
gccccacctc c 



11 
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<210> 60 

<211> 11 

<212> DNA 

<213> Homo sapiens 

<400> 60 

gccccgcctc c 11 



<210> 61 

<211> 11 

<212> DNA 

<213> Homo sapiens 

<400> 61 

ctcacactgg g 11 



<210> 62 
<211> 11 

<212> DNA 

<213> Homo sapiens 

<400> 62 

ctcacgctgg g 11 



<210> 63 

<211> 11 

<212> DNA 

<213> Homo sapiens 

<400> 63 

gagacgcacc a 11 



<210> 64 

<211> 11 

<212> DNA 

<213> Homo sapiens 

<400> 64 

gagacacacc a 11 



<210> 65 
<211> 11 

<212> DNA 

<213> Homo sapiens 

<400> 65 

tgtgtgtggg a 11 



<210> 66 

<211> 11 

<212> DNA 

<213> Homo sapiens 



<400> 66 
tgtgtatggg a 



11 
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<210> 67 

<211> 11 

<212> DNA 

<213> Homo sapiens 

<400> 67 

atccatgtat a 11 



<210> 68 

<211> 11 

<212> DNA 

<213> Homo sapiens 



<210> 69 

<211> 11 

<212> DNA 

<213> Homo sapiens 

<400> 69 

catcttaccc c 11 



<210> 70 

<211> 11 

<212> DNA 

<213> Homo sapiens 



<210> 71 

<211> 11 

<212> DNA 

<213> Homo, sapiens 

<400> 71 

tctattgcta t 11 



<210> 72 
<211> 11 

<212> DNA 

<213> Homo sapiens 



<400> 68 
atccacgtat a 



11 



<400> 70 
catctcaccc c 



11 



<400> 72 
tctatagcta t 



11 



<210> 73 
<211> 11 
<212> DNA 



<213> Homo sapiens 



<400> 73 
ggcagggaag c 



11 
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<210> 74 

<211> 11 

<212> DNA 

<213> Homo sapiens 

<400> 74 

ggcagagaag c 11 

<210> 75 

<211> 11 

<212> DNA 

<213> Homo sapiens 



<210> 76 

<211> 11 

<212> DNA 

<213> Homo sapiens 

<400> 76 

ccaggtaaac a 11 



<210> 77 

<211> 11 

<212> DNA 

<213> Homo sapiens 



<210> 78 

<211> 11 

<212> DNA 

<213> Homo sapiens 

<400> 78 

tcaagaagta g 11 



<210> 79 
<211> 11 

<212> DNA 

<213> Homo sapiens 



<400> 75 
ccaggcaaac a 



11 



<400> 77 
tcaaggagaa g 



11 



<400> 79 
gtacacatgg a 



11 



<210> 80 
<211> 11 
<212> DNA 



<213> Homo sapiens 



<400> 80 
gtacatatgg a 



11 
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<210> 81 

<211> 10 

<212> DNA 

<213> Homo sapiens 

<400> 81 

catggacttt 10 



<210> 82 

<211> 11 

<212> DNA 

<213> Homo sapiens 



<210> 83 

<211> 11 

<212> DNA 

<213> Homo sapiens 

<400> 83 

gatagcaggc c 11 



<210> 84 

<211> 11 

<212> DNA 

<213> Homo sapiens 



<210> 85 

<211> 11 

<212> DNA 

<213> Homo sapiens 

<400> 85 

agaatcgggc t 11 



<210> 86 
<211> 11 

<212> DNA 

<213> Homo sapiens 



<400> 82 
catgggactt t 



11 



<400> 84 
gatagtaggc c 



11 



<400> 86 
agaatagggc t 



11 



<210> 87 
<211> 11 
<212> DNA 



<213> Homo sapiens 



<400> 87 
ttattctgtc t 



11 
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<210> 88 

<211> 11 

<212> DNA 

<213> Homo sapiens 



<400> 88 
ttattatgtc t 



11 



<210> 89 

<211> 11 

<212> DNA 

<213> Homo sapiens 



<400> 89 
gtacaggaag g 



11 



<210> 90 

<211> 11 

<212> DNA 

<213> Homo sapiens 



<400> 90 
cctaaaaaat g 



11 



<210> 91 

<211> 11 

<212> DNA 

<213> Homo sapiens 



<400> 91 
tcttttatct t 



11 



<210> 92 

<211> 11 

<212> DNA 

<213> Homo sapiens 



<400> 92 
tctttgatct t 



11 



<210> 93 
<211> 11 

<212> DNA 

<213> Homo sapiens 



<400> 93 
gagtctgcac a 



11 



<210> 94 

<211> 11 

<212> DNA 

<213> Homo sapiens 



<400> 94 
gagtccgcac a 



11 
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<210> 95 
<211> 11 
<212> DNA 

<213> Homo sapiens 



<400> 95 
gaagtcaaga a 



11 



<210> 96 

<211> 11 
<212> DNA 

<213> Homo sapiens 



<400> 96 
agtcgtcaag a 



11 



<210> 97 

<211> 11 

<212> DNA 

<213> Homo sapiens 



<400> 97 
aggaagtatt c 



11 



<210> 98 
<211> 11 
<212> DNA 

<213> Homo sapiens 



<400> 98 
aggaattatt c 



11 



<210> 99 

<211> 11 
<212> DNA 

<213> Homo sapiens 



<400> 99 
agagagcttc a 



11 



<210> 100 
<211> 11 

<212> DNA 

<213> Homo sapiens 



<400> 100 
agagaacttc a 



11 



<210> 101 

<211> 11 

<212> DNA 

<213> Homo sapiens 



<400> 101 
cttcaatagt a 



11 
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<210> 102 

<211> 11 

<212> DNA 

<213> Homo sapiens 



<400> 102 
cttcagtagt a 



11 



<210> 103 

<211> 11 

<212> DNA 

<213> Homo sapiens 



<400> 103 
tccaacttat 



11 



<210> 104 
<211> 11 

<212> DNA 

<213> Homo sapiens 



<400> 104 
tccaaattat g 



11 



<210> 105 

<211> 11 

<212> DNA 

<213> Homo sapiens 



<400> 105 
cgaaactaca t 



11 



<210> 106 

<211> 11 

<212> DNA 

<213> Homo sapiens 



<400> 106 
cgaaattaca t 



11 



<210> 107 
<211> 11 

<212> DNA 

<213> Homo sapiens 



<400> 107 
aaggatttct a 



11 



<210> 108 

<211> 11 

<212> DNA 

<213> Homo sapiens 



<400> 108 
aaggacttct a 



11 
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<210> 109 

<211> 11 

<212> DNA 

<213> Homo sapiens 

<400> 109 
agctccgttg t 



11 



<210> 110 

<211> 11 

<212> DNA 

<213> Homo sapiens 



<400> 110 
agctctgttg t 



11 



<210> 111 
<211> 10 

<212> DNA 

<213> Homo sapiens 



<400> 111 
cacctaccta 



10 



<210> 112 

<211> 11 

<212> DNA 

<213> Homo sapiens 



<400> 112 
caccttacct a 



11 



<210> 113 
<211> 830 
<212> DNA 
<213> Homo sapiens 



<400> 113 

tggtcaccca 

ccagactgtg 

gggtggtaga 

gaggaaatgg 

tgtctggctg 

gcagagggaa 

attctggaat 

atgccaatgg 

aactgattaa 

gcftgtgtgcg 

aaacatttca 

gcccagcaaa 

aagtggcgat 

gcctggtgct 



ccatgtgtac 
cccttgagga 
gaggaaatag 
ttacatctgt 
ggcgtgcaag 
ttgcatgggt 
gaagacagcc 
ctccacttca 
gttgtttatg 
attctttgct 
gcagcttgac 
caacagcaca 
ggacctcatc 
cctctatctg 



agtaccctgc 
actcacctct 
gacaatagga 
gtgaggaggt 
gatgtgtagg 
aaagatctgc 
atggaaacaa 
gtttctgata 
attccccata 
actggctgca 
ctaagactgc 
cagctgaaag 
ccaaatttgg 
tcagtaactg 



tagggtccag 
gctaagggaa 
ctgtgtgagg 
tggtaaggaa 
agtcatctag 
agttgtggct 
gggcaggtga 
agaactcagg 
gaatatgaac 
gctgcagccc 
tgtgcagggc 
taagactcag 
cggtggaaac 
tccagattcc 



ggtcatgaaa 
acaggcacag 
gggataggag 
agactttaat 
ggggcacaag 
tgtggggatg 
gaggatattt 
ttccgtggac 
tcaaaggagg 
cgcctccttc 
agggatgctc 
aggagacagt 
ctggcttctc 
tctcctctgt 



gtaaataata 
aaacccacaa 
gcacccagag 
agaaggggtc 
tacactccag 
gatttcaagt 
aagaggcttc 
tccctgataa 
taagcaaagg 
tccagcacat 
caggcagaca 
tgaagaaggc 
ctggctgtca 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

830 



<210> 114 
<211> 830 
<212> DNA 
<213> Homo sapiens 
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<400> 114 

tggtcaccca 

ccagactgtg 

gggtggtaga 

gaggaaatgg 

tgtctggctg 

gcagagggaa 

attctggaat 

atgccaatgg 

aactgattaa 

ggtgtgtgcg 

aaacatttca 

gcccagcaaa 

aagtggcgat 

gcctggtgct 



ccatgtgtac 
cccttgagga 

gaggaaatag 
ttacatctgt 
ggcttgcaag 
ttgcatgagt 
gaagacagcc 
ctccacttca 
gttgtttatg 
attctttgct 
gcagcttgac 
caacagcaca 
ggacctcatc 
cctctatctg 



agtaccctgc 
actcacctct 
gacaatagga 
gtgaggaggt 
gatgtgtagg 
aaagatctgc 
atggaaacaa 
gtttctgata 
attccccata 
actggctgca 
ctaagactgc 
cagctgaaag 
ccaaatttgg 
tcagtaactg 



tagggtccag 
gctaagggaa 

ctgtgtgagg 
tggtaaggaa 
agtcatctag 
agttgtggct 
gggcaggtga 
agaactcagg 
gaatatgaac 
gctgcagccc 
tgtgcagggc 
taagactcag 
cggtggaaac 
tccagattcc 



ggtcatgaaa 
acaggcacag 

gggataggag 
agactttaat 
ggggcacaag 
tgtggggatg 
gaggatattt 
ttccgtggac 
tcaaaggagg 
cacctccttc 
agggatgctc 
aggagacagt 
ctggcttctc 
tctcctctgt 



gtaaataata 
aaacccacaa 

gcacccagag 
agaaggggtc 
tacactccag 
gatttcaagt 
aagaggcttc 
tccctgataa 
taagcaaagg 
tccagcacat 
caggcagaca 
tgaagaaggc 
ctggctgtca 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

830 



<210> 115 
<211> 830 
<212> DNA 
<213> Homo sapiens 



<400> 115 

tggtcaccca 

ccagactgtg 

gggtggtaga 

gaggaaatgg 

tgtctggctg 

gcagagggaa 

attctggaat 

atgccaatgg 

aactgattaa 

ggtatgtgcg 

aaacatttca 

gcccagcaaa 

aagtggcgat 

gcctggtgct 



ccatgtgtac 
cccttgagga 
gaggaaatag 
ttacatctgt 
ggcttgcaag 
ttgcatgggt 
gaagacagcc 
ctccacttca 
gttgtttatg 
attctttgct 
gcagcttgac 
caacagcaca 
ggacctcatc 
cctctatctg 



agtaccctgc 
actcacctct 

gacaatagga 
gtgaggaggt 
gatgtgtagg 
aaagatctgc 
atggaaacaa 
gtttctgata 
attccccata 
actggctgca 
ctaagactgc 
cagctgaaag 
ccaaatttgg 
tcagtaactg 



tagggtccag 
gctaagggaa 
ctgtgtgagg 
tggtaaggaa 
agtcatctag 
agttgtggct 
gggcaggtga 
agaactcagg 
gaatatgaac 
gctgcagccc 
tgtgcagggc 
taagactcag 
cggtggaaac 
tccagattcc 



ggtcatgaaa 
acaggcacag 

gggataggag 
agactttaat 
ggggcacaag 
tgtggggatg 
gaggatattt 
ttccgtggac 
tcaaaggagg 
cacctccttc 

agggatgctc 
aggagacagt 
ctggcttctc 
tctcctctgt 



gtaaataata 
aaacccacaa 
gcacccagag 
agaaggggtc 
tacactccag 
gatttcaagt 
aagaggcttc 
tccctgataa 
taagcaaagg 
tccagcacat 
caggcagaca 
tgaagaaggc 
ctggctgtca 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

830 



<210> 116 
<211> 830 
<212> DNA 
<213> Homo sapiens 



<400> 116 

tggtcaccca 

ccagactgtg 

gggtggtaga 

gaggaaatgg 

tgtctggctg 

gcagagggaa 

attctggaat 

atgccaatgg 

aactgattaa 

ggtgtgtgca 

aaacatttca 

gcccagcaaa 

aagtggcgat 

gcctggtgct 



ccatgtgtac 
cccttgagga 
gaggaaatag 
ttacatctgt 
ggcttgcaag 
ttgcatgggt 
gaagacagcc 
ctccacttca 
gttgtttatg 
attctttgct 
gcagcttgac 
caacagcaca 
ggacctcatc 
cctctatctg 



agtaccctgc 
actcacctct 
gacaatagga 
gtgaggaggt 
gatgtgtagg 
aaagatctgc 
atggaaacaa 
gtttctgata 
attccccata 
actggctgca 
ctaagactgc 
cagctgaaag 
ccaaatttgg 
tcagtaactg 



tagggtccag 
gctaagggaa 
ctgtgtgagg 
tggtaaggaa 
agtcatctag 
agttgtggct 
gggcaggtga 
agaactcagg 
gaatatgaac 
gctgcagccc 
tgtgcagggc 
taagactcag 
cggtggaaac 
tccagattcc 



ggtcatgaaa 
acaggcacag 
gggataggag 
agactttaat 
ggggcacaag 
tgtggggatg 
gaggatattt 
ttccgtggac 
tcaaaggagg 
cacctccttc 
agggatgctc 
aggagacagt 
ctggcttctc 
tctcctctgt 



gtaaataata 60 
aaacccacaa 120 
gcacccagag 180 
agaaggggtc 240 
tacactccag 300 
gatttcaagt 360 
aagaggcttc 420 
tccctgataa 480 
taagcaaagg 540 
tccagcacat 600 
caggcagaca 660 
tgaagaaggc 720 
ctggctgtca 780 
830 



<210> 117 
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<211> 775 
<212> DNA 
<213> Homo sapiens 



<400> 117 

ggagtgacct 

tccttgaccc 

agcacgctgc 

gtacctcaga 

agaccggagc 

tgaaagtgta 

gtctgagaca 

cactggaaag 

accacaaggc 

cttggtagga 

atgatgtcac 

ataaagcagg 

aaacgctttg 



gattttccag 
cttgcgcttc 
acccactgtc 
tgaaaatgca 
tgttcctatt 
ttgatgtggg 
atgcatgtga 
aacgtgatgc 
tcccagccta 
caagccttga 
gtgtgtattt 
ctgtgaagag 
cgaaaccaca 



gtgctgtctg 
tcaagtgagg 
ctgcacccac 
gaaatcaccc 
cggccatctt 
aaggagataa 
taaaaggtca 
tgggagtccc 
gtttcactga 
aaagttttac 
gtgtgtgtgt 
gggattccca 
agtttgcaca 



tcaccccttt 
caatgcctcg 
tgtctggcac 
gtcttctgtg 
ggctccaccc 
tgccatgcat 
cctaaggaag 
tgggccacca 
agacctcgac 
aatagcaaat 
gtcagtgtgt 
tgctcgtgtg 
aaggcaatcc 



ctttgactag 
ccctgcttcg 
tccctagtga 
tcactcacgc 
cccgagtttt 
ttatgagcac 
aaaaaaagaa 
aagtctggag 
actaggtgac 
gtggacgttg 
gtgtttaaaa 
cctgataaca 
caaccttaca 



gaaagggaac 60 
gcttgtgcac 120 
gatgaacccg 180 
tgggagctgt 240 
ggcttttaat 300 
atattagagg 3 60 
c aagggaaga 42 0 
aaaagtggta 480 
ttatgggatc 540 
tcagaaccaa 600 
atcatgacaa 660 
caactatcac 720 
caaaa 775 



<210> 118 

<211> 1110 

<212> DNA 

<213> Homo sapiens 

<400> 118 

taatgaaaat aagaattatt ttgatggctc taacagtgac atttatatca tctgttttat 60 
ctggagcatt ctataataag tttatattaa gcaaatcaat aaaaacctct tacaaaagta 12 0 
tcatcggata ctttcctgaa cattaaggag aaatctatag aactgaatga atgagaacca 18 0 
acaagtaaat atatgtgatc attgtaacca ttgttggtgt ggggcatttg tcagaactcc 240 
aatgtgatta ttaacatagg tgagaattaa tccactgtga ctttgcccat tgcttagaaa 300 
gaacattcat agtttaatta tgcctttttt gaccaagcac agtggctcat gcctgtaatc 3 60 
ccagcacttt gggaggccga ggtgggtgga tcacctgagg tcaggagttc gagaccagcc 42 0 
tgaccaacat ggtgaaaccc catctctact aaaaatacaa aaattagcta ggtgtggtgg 480 
tatgcaccta taatctcagc tacccaggag gctgaggcag gagaatcact tgaacctgga 540 
gg.cagaggtt gcagtgagcc gagacacacc attacactcc agcctgggtg acagagtgag 600 
attccatctc aaaaaaaaaa aaaaaaaatt atgccttttt gaagcacata cattttataa 660 
catacaactg aatcccttat tatattatta gttttgattt aatgttttca aaccatctcc 720 
cctgatattt ctgggagatg ggaaacatgt tttcttacac ctcttgcatt ccattctcaa 780 
ctcccaactg tcttactgca atgaacactt aataagaaac agtcaattgg tcaattgatt 840 
gggcaacagg ctaaacacac tcattccttg tctgttccca cttctttctt tactttccct 9 00 
tcctgagtaa cttatcctaa agtcattagg tgggtggcag ccagatggtg gccacacatt 960 
aaggtagaaa agagagtgtc atgatggttc caagtcagag acctagtagg gtgaggatca 1020 
agtaggtgtt cacgtggaga aacagcccgg cctgtgtgtg ggagtccaag caagcagaga 108 0 
aaatgtcgac acagaggggt ggcctgaaaa 1110 



<210> 119 
<211> 837 
<212> DNA 
<213> Homo sapiens 



<400> 119 

caaaaattag 

caggagaatc 

tccagcctgg 

tttgaagcac 

tttaatgttt 

cacctcttgc 

aacagtcaat 

ccacttcttt 

cagccagatg 

gagacctagt 



ctaggtgtgg 
acttgaacct 
gtgacagagt 
atacatttta 
tcaaaccatc 
attccattct 
tggtcaattg 
ctttactttc 
gtggccacac 
agggtgagga 



tggtatgcac 
ggaggcagag 
gagattccat 
taacatacaa 
tcccctgata 
caactcccaa 
attgggcaac 
ccttcctgag 
attaaggtag 
tcaagtaggt 



ctataatctc 
gttgcagtga 
ctcaaaaaaa 
ctgaatccct 
tttctgggag 
ctgtcttact 
aggctaaaca 
taacttatcc 
aaaagagagt 
gttcacgtgg 



agctacccag 
gccgagacgc 
aaaaaaaaaa 
tattatatta 
atgggaaaca 
gcaatgaaca 
cactcattcc 
taaagtcatt 
gtcatgatgg 
agaaacagcc 



gaggctgagg 60 
accattacac 12 0 
attatgcctt 180 
ttagttttga 240 
tgttttctta 300 
cttaataaga 360 
ttgtctgttc 420 
aggtgggtgg 480 
ttccaagtca 540 
cggcctgtgt 600 
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Sitgggagtcc aagcaagcag agaaaatgtc gacacagagg ggtggcctga ciciciciyuciyt;u oou 
agagcctaaa cagggcatgg agaacatatt tagggcatga ggtgaggagg gcatccatga 720 

gtgggaaggg atgggtgagg tttcactaca taaaggggat tgatgaaata agtaaataaa 780 
gtatactgga agccaggtgt gtcacttttg cagaaaagag tcatggattc agaaagg 837 



<210> 120 

<211> 1197 

<212> DNA 

<213> Homo sapiens 



<400> 120 

ttctttcttt 

cagatggtgg 

cctagtaggg 

gagtccaagc 

cctaaacagg 

gaagggatgg 

actggaagcc 

ctagcaggaa 

ctagatagat 

tagacataaa 

ctctccactg 

ccagatcttg 

tggctgattc 

caggaagcaa 

tgatttccaa 

ccactaaaga 

acatacaaaa 

aatgtaatcc 

gaaagatgtc 

aaagaaggat 



actttccctt 
ccacacatta 
tgaggatcaa 
aagcagagaa 
gcatggagaa 
gtgaggtttc 
aggtgtgtca 
tcctatgaaa 
aaatggttag 
tgtatgtatg 
atagggctag 
gcttattaat 
cagggccagg 
gaagatgttc 
atgatatttc 
actattagat 
accagtagca 
catttacaat 
tacaattaat 
attccatgtt 



cctgagtaac 
aggtagaaaa 
gtaggtgttc 
aatgtcgaca 
catatttagg 
actacataaa 
cttttgcaga 
ttagattaaa 
ataggtgata 
tgtttgtgtg 
gtaacaatgg 
accattttcc 
attaagaatg 
aaatgatttc 
caaatgattt 
ctgataaaca 
tttctgcatg 
aaccccaaat 
attgtaaaac 
tatatattgt 



ttatcctaaa 
gagagtgtca 
acgtggagaa 
cagaggggtg 
gcatgaggtg 
ggggattgat 
aaagagtcat 
atggatgtat 
aaaagataac 
tgtgtacaaa 
catttcaata 
actgaaagga 
ttcaagataa 
caagtaatgt 
ccaaatgata 
aattcagtaa 
ccaacagtga 
aaaactaaat 
actgatgaag 
aagcattaat 



gtcattaggt 
tgatggttcc 
acagcccggc 
gcctgaaaaa 
aggagggcat 
gaaataagta 
ggattcagaa 
ccacgtatat 
aagaggacaa 
aaaacatata 
gcaatgagca 
accagagctt 
gcctaggata 
ttggaaatga 
tatggaaaca 
tgttgctgga 
acaatctggc 
acctgggaat 
gaaattgaag 
attgttaaaa 



srergtggcagc 

aagtcagaga 
ctgtgtgtgg 
gcagccagag 
ccatgagtgg 
aataaagtat 
agggagaaaa 
tcataccctt 
gataattaga 
ctccctactt 
cacttagtgg 
tttagagaaa 
cattttgtgc 
tatttgaaaa 
cttaaagact 
tacaaaatca 
aaaaataaaa 
taacttaaga 
aagacacaaa 
atgtcca 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1197 



<210> 121 
<211> 817 

<212> DNA 

<213> Homo sapiens 



<400> 121 

caaagcaaaa 

gcaatcaaca 

tctgacaaca 

tctaataatc 

acaaatgcca 

aatcaaaacc 

aggcaacaac 

taaattagtg 

attaaataga 

ggaaatcata 

atgccaagat 

tccaattata 

gtaataatat 

cacagatatt 



atggacaaat 
aagtgaagac 
gattaatagc 
caatcaaaaa 
cataggcata 
acaatgagat 
aaatgccagc 
caaccactat 
gctaccacaa 
tattgaagag 
ttggaagcaa 
cacaatggag 
ggatggaact 
gcaagttctc 



ggatcagatc 
acaaaccaca 
cagaatacat 
atgggcaaaa 
tgataaggtg 
atcatctcac 
gagaatgtgg 
agagaacaat 
tatccagaaa 
ataacatcac 
cctaagtgtc 
cacaattcag 
ggaggtcatc 
acatacttgt 



aagttaaaaa 
gaatgggaga 
gaagcgctca 
tttgaataga 
ctcaacatca 
cccagctaaa 
agaaaaggga 
ttggaggttc 
tccccatgct 
tccaatattc 
catcaacaga 
ccatgaaaaa 
atgttaagtg 
gggatct 



gcttctgtac 
aaatattttc 
aacaactctg 
catttttcaa 
ctggtcatta 
atggttttta 
acccttgtac 
ctcaaaacat 
gggtatatac 
acaatagcca 
tgaatggata 
agcatgagat 
aaataagcca 



cacaaagaaa 60 
aaagtcacac 12 0 
taaggaaaaa 18 0 
aagaagacat 240 
gagaaatgca 300 
tccaaaagac 360 
actgttggtg 42 0 
taaaattaac 48 0 
ctggaagaaa 540 
ctattcacaa 600 
aagaaagtac 660 
cctgttatct 720 
ggcacagaaa 780 
817 



<210> 122 
<211> 1126 

<212> DNA 

<213> Homo sapiens 



<400> 122 

cattaaaatt aacattaaat agagctacca caatatccag aaatccccat gctgggtata 60 
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tacctggaag 
ccactattca 
ataaagaaag 
gatcctgtta 
ccaggcacag 
acaactgagc 
tttaaaatac 
tgggtgttct 
aagtctcttc 
ttttcctttg 
tcccaagcaa 
caatctcacc 
tgctaggaac 
tctgctaagg 
ggactgtgtg 
ggttggtaag 
a-g"g*a.gtcatc 
tgtaggtgtg 



aaaggaaatc 
caaatgccaa 
tactccaatt 
tctgtaataa 
aaacacagat 
taatgtctgg 
atcatgaatg 
tctgatacac 
atgttatgtt 
cagaacaaga 
ccattagtct 
caagacaact 
cagggtcatg 
gaaacaggca 
agggggatag 
gaaaaatttt 
tagagggcac 
gcttgtgagg 



atatattgaa 
gatttggaag 
atacacaatg 
tatggatgga 
attgcaagtt 
gccttagtca 
ctttaataca 
agtatcttcc 
accttatgag 
ccaactttat 
atagctatca 
ccaccaacat 
aaagtaaata 
tagaaactta 
gaggcaccca 
agcagaaggg 
aggtacactc 
atggatttca 



gagataacat 
caacctaagt 
gagcacaatt 
actggaggtc 
ctcacatact 
gtgttgtacc 
ggaatgaata 
ttgacagatt 
gaattaagtg 
tagttgggac 
ccacagagtc 
tcctggttac 
ataccagact 
caatggtggt 
gaggaggaaa 
gtctgtctgg 
caggcagagg 
attattctag 



cactccaata 
gtccatcaac 
cagccatgaa 
atcatgttaa 
tgtgggatct 
caagtactgg 
gatgagaggc 
cagtacaact 
gcagaacatg 
acagtgtggc 
agaggggatg 
ccaccatgtg 
gtgcccttga 
agagagaaaa 
tggttacatt 
ctgggcttgg 
gaatttcgtg 
aatgaa 



agatgaatgg 
aaaagcatga 
gtgaaataag 
acaaatcaaa 
gage ac age t 
acaaactggt 
ctcaacaggt 
atttctatta 
tgcatttgag 
agacgcccag 
tacagtaccc 
ggagctcacc 
gaggacaata 
tgtgtgagga 
aaggatacgt 
ggtaaagatg 



180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1126 



<210> 123 
<211> 624 
<212> DNA 
<213> Homo sapiens 



<400> 123 

gattaagctt 

tgtgcgattc 

ctttcagcag 

agcaaacagc 

ggcgatggac 

ggtgctcctc 

gtgctaatcg 

aagtagctga 

tatgaatgaa 

ttagggacag 

caatgaaaac 



ttcatgattc 
tttgctattg 
cttggctgaa 
agcactcagc 
ctcatcccaa 
tatctgtgag 
ggcccctttt 
agtgttggac 
taaataagca 
catttggtag 
atggcagcgg 



ctcatagaac 
gctgcagcta 
gactgctgtg 
taaaaggaag 
atttggcggt 
taactgtcca 
cccttatctg 
gctacaaacg 
tttctcccat 
tgggaatgat 
caga 



atgaactcaa 
tagccctgcc 
cagggcagag 
actcacagaa 
ggaaacctgg 
aactcctctc 
ttttgaagat 
catagaagtt 
ccaccttcta 
ttgattagct 



aagaggtcag 
tccttctcca 
aagctccagg 
cacagttgaa 
cttctcctgg 
tttgtttcct 
caaaagagat 
attattatct 
attttggtga 
tagatctgac 



caaaggggtg 60 
gcacataaat 120 
caaacagccc 18 0 
gaaggaaagt 240 
ctgtcagcct 3 00 
tggacttggg 3 60 
gttcaaggag 42 0 
tatgcagatc 48 0 
ctaggagggt 540 
gaagactaat 60 0 
624 



<210> 124 
<211> 624 
<212> DNA 
<213> Homo sapiens 



<400> 124 

gattaagctt 

tgtgcgattc 

ctttcagcag 

agcaaacagc 

ggcgatggac 

ggtgctcctc 

gtgctaatcg 

aagtagctga 

tatgaatgaa 

ttagggacag 

caatgaaaac 



ttcatgattc 
tttgctattg 
cttggctgaa 
agcactcagc 
ctcatcccaa 
tatctgtgag 
ggcccctttt 
agtgttggac 
taaataagca 
catttggtag 
atggcagcgg 



ctcatagaac 
gctgcagcta 
gactgctgtg 
taaaaggaag 
atttggcggt 
taactgtcca 
cccttatctg 
gctacaaacg 
tttctcccat 
tgggaatgat 
caga 



atgaactcaa 
tagccctgcc 
cagggcaggg 
actcacagaa 
ggaaacctgg 
aactcctctc 
ttttgaagat 
catagaagtt 
ccaccttcta 
ttgattagct 



aagaggtcag 
tccttctcca 
aagctccagg 
cacagttgaa 
cttctcctgg 
tttgtttcct 
caaaagagat 
attattatct 
attttggtga 
tagatctgac 



caaaggggtg 60 
gcacataaat 120 
taaacagccc 180 
gaaggaaagt 240 
ctgtcagcct 300 
tggacttggg 360 
gttcaaggag 42 0 
tatgcagatc 48 0 
ctaggagggt 540 
gaagactaat 600 
624 



<210> 125 
<211> 621 
<212> DNA 
<213> Homo sapiens 
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<400> 125 

gattaagctt 

tgtgcgattc 

ctttcagcag 

agcaaacagc 

ggcgatggac 

ggtgctcctc 

gtgctaatcg 

tagctgaagt 

gaatgaataa 

gggacagcat 

tgaaaacatg 



ttcatgattc 
tttgctattg 
cttggctgaa 
agcactcagc 
ctcatcccaa 
tatctgtgag 
ggcccctttt 
gttggacgct 
ataagcattt 
ttggtagtgg 
gcagcggcag 



ctcatagaac 
gctgcagcta 
gactgctgtg 
taaaaggaag 
atttggcggt 
taactgtcca 
cccttatctg 
acaaacgcat 
ctcccatcca 
gaatgatttg 
a 



atgaactcaa 
tagccctgcc 
cagggcaggg 
actcacagaa 
ggaaacctgg 
aactcctctc 
ttttgaagat 
agaagttatt 
ccttctaatt 
attagcttag 



aagaggtcag 
tccttctcca 
aagctccagg 
cacagttgaa 
cttctcctgg 
tttgtttcct 
caaaagagat 
attatcttat 
ttggtgacta 
atctgacgaa 



caaaggggtg 60 
gcacataaat 120 
caaacagccc 180 

gaaggaaagt 240 
ctgtcagcct 300 
tggacttggg 3 60 
gttcaagaag 42 0 
gcagatctat 480 

ggagggttta 54 o 

gactaatcaa 60 0 
621 



<210> 126 
<211> 465 

<212> DNA 

<213> Homo sapiens 



<400> 126 

aggaaaggac 

gtgtacatat 

gatctataaa 

cgtacatatg 

gggaaatgtt 

gttgcaaaca 

ggtttcacat 

ctctgtgagg 



ctgatgagtg 

tacctccctc 
gtcacaatcc 
ggacttttta 
ttgtcctatc 
tcagcttagt 
tttcagaaat 
ccaagactgg 



aatgcaatta 
tcttgaccat 
ctgtgacctg 
agagactggg 
gtcaggtgag 
tccatcagta 
ggtgggactg 
caaattgctt 



ctgatgttgg 

tccagttcct 
atttctgttt 
aattccaggg 
ttgcttgagc 
aaaatgcccc 
ggtgcagtgg 
gagcccagga 



agttgctgtt 

gagtaactca 
cactttgtag 
cccacacctc 
ttcctctttt 
tccttgggag 
atcatgcctg 
gtttg 



attatttatc 60 
ccagccctct 120 
atatgggacc 180 
tgcctttgtt 240 
gcttcttatg 300 
ggagttctga 3 60 
taatctcagc 42 0 
465 



<210> 127 
<211> 33 
<212> PRT 

<213> Homo sapiens 
<400> 127 

Met Asp Leu lie Pro Asn Leu Ala Val Glu Thr Trp Leu Leu Leu Ala 
15 10 15 

Val Ser Leu Val Leu Leu Tyr Leu Tyr Gly Thr Arg Thr Tyr Gly Thr 
20 25 30 

Phe 



<210> 128 
<211> 602 
<212> DNA 
<213> Homo sapiens 



<400> 128 

catagacaag 

cacttcaaat 

ttcaactgcc 

aattagaggt 

ccagggtctc 

agtattctga 

gatagtctct 

agaggttctc 

cttgccgtta 

aaaatatgag 

ta 



ggtgagtcct 
atattctctg 
agtgaaagat 
agggtttatt 
tggaaatttg 
aaacctccat 
gcccaggtct 
tgaaagaaga 
taaactatgt 
aagaatctca 



tcagtactta 
ttttcttgtc 
agtaggcctg 
ctatttaaaa 
acacagagtg 
tggatagacc 
tcatgggatg 
ggataattac 
gcaaattcag 
taactgtttt 



gagaaaattc 
tttcccttaa 
atttcattgg 
taataatcaa 
ctataaaaag 
tgctactgtg 
aagctcttgt 
ttgggagtag 
ggaggtaaac 
gagataatta 



aagagtgact 


ttaaattccc 


60 


gacatctctg 


aatagcttcc 


120 


acgcaactgt 


tttcagcccc 


180 


cttgtatttt 


gtttcctctc 


240 


tatggaaaaa 


tgtgggggtg 


300 


aggaggttac 


cccactgcag 


360 


caacctaaat 


acaaacagag 


42 0 


aatattgcaa 


tgggaatctg 


480 


aagacaaaga 


tgctccatag 


540 


ttgttagcta 


caaagatcaa 


600 






602 
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<210> 129 
<211> 594 
<212> DNA 
<213> Homo sapiens 



<400> 129 
cagtatctct 

ctgtgtgaga 
gtcgtacaac 
tggctccagc 
acgtatgaag 
ctagtgaaag 
tttaaataat 
aaggttacct 
ggtaacatct 
tggcacaaaa 



tccctgtttg 

ctcttgctgt 
taggggtatg 
tgcagaatag 
gtcaactccc 
aatgttattc 
gattgatcca 
aaaaaatgta 
tgcaatcttg 
gttaaaatgg 



gaccacatta 
gtgtcacacc 
gattacataa 
ggctagtgaa 
tgtgctggcc 
tgtcttcaca 
ctgattaaat 
caggaaggtt 
tatattgcaa 
caaactacag 



cccttcatca 
ctaatgaact 
cataatgatc 

gtttaatcag 
atcacagatc 
aatcgaaggg 
ttttattttg 
ccatgtactc 
tatatatcta 
gctgggcata 



tatgaagcct 
agaacctaag 
aaagtctggc 
ctccgttgtc 
ccgacgtgat 
taagcatcca 
aaaaaaacat 
ttcatcctgt 
gtatattcat 
atggctcatg 



tgggtggctc 60 
gttgctgtgt 120 
ttcctgggtg 180 
cccacacaga 240 
cagaacagtg 3 00 
ttttttgaaa 3 60 
atattcacag 420 
cccgcccagt 480 
attatcaggt 540 
cctg 594 



<210> 130 
<211> 594 
<212> DNA 
<213> Homo sapiens 

<400> 130 

cagtatctct tccctgtttg gaccacatta 
ctgtgtgaga ctcttgctgt gtgtcacacc 
gtcgtacaac taggggtatg gattacataa 
tggctccagc tgcagaatcg ggctagtgaa 
acgtatgaag gtcaactccc tgtgctggcc 
ctagtgaaag aatgttattc tgtcttcaca 
tttaaataat gattgatcca ctgattaaat 
aaggttacct aaaaaatgta caggaaggtt 
ggtaacatct tgcaatcttg tatattgcaa 
tggcacaaaa gttaaaatgg caaactacag 



cccttcatca tatgaagcct tgggtggctc 60 
ctaatgaact agaacctaag gttgctgtgt 12 0 
cataatgatc aaagtctggc ttcctgggtg 180 
gtttaatcag ctctgttgtc cccacacaga 240 
atcacagatc ccgacgtgat cagaacagtg 3 00 
aatcgaaggg taagcatcca ttttttgaaa 360 
ttttattttg aaaaaaacat atattcacag 42 0 
ccatgtactc ttcatcctgt cccgcccagt 480 
tatatatcta gtatattcat attatcaggt 540 
gctgggcata atggctcatg cctg 594 



<210> 131 
<211> 594 
<212> DNA 
<213> Homo sapiens 



<400> 131 

cagtatctct 

ctgtgtgaga 

gtcgtacaac 

tggctccagc 

acgtatgaag 

ctagtgaaag 

tttaaataat 

aaggttacct 

ggtaacatct 

tggcacaaaa 



tccctgtttg 
ctcttgctgt 
taggggtatg 
tgcagaatcg 
gtcaactccc 
aatgttatta 
gattgatcca 
aaaaaatgta 
tgcaatcttg 
gttaaaatgg 



gaccacatta 
gtgtcacacc 
gattacataa 
ggctagtgaa 
tgtgctggcc 
tgtcttcaca 
ctgattaaat 
caggaaggtt 
tatattgcaa 
caaactacag 



cccttcatca 
ctaatgaact 
cataatgatc 
gtttaatcag 
atcacagatc 
aatcgaaggg 
ttttattttg 
ccatgtactc 
tatatatcta 
gctgggcata 



tatgaagcct 
agaacctaag 
aaagtctggc 
ctccgttgtc 
ccgacgtgat 
taagcatcca 
aaaaaaacat 
ttcatcctgt 
gtatattcat 
atggctcatg 



tgggtggctc 60 
gttgctgtgt 120 
ttcctgggtg 18 0 
cccacacaga 240 
cagaacagtg 300 
ttttttgaaa 3 60 
atattcacag 420 
cccgcccagt 480 
attatcaggt 540 
cctg 594 



<210> 132 

<211> 60 

<212> PRT 

<213> Homo sapiens 



<400> 132 

Met Trp Gly Thr Tyr Glu Gly Gin Leu Pro Val Leu Ala He Thr Asp 
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15 10 

Pro Asp Val lie Arg Thr Val Leu Val Lys Glu Cys Tyr Tyr Val Phe 
20 25 30 

Thr Ash Arg Arg Ser Leu Gly Pro Val Gly Phe Met Lys Ser Ala lie 
35 40 45 

Ser Leu Ala Glu Asp Glu Glu Trp Lys Arg lie Arg 
50 55 60 



<210> 133 
<211> 601 
<212> DNA 
<213> Homo sapiens 



<400> 133 

cagtatctct tccctgtttg gaccacatta 

ctgtgtgaga ctcttgctgt gtgtcacacc 

gtcgtacaac taggggtatg gattacataa 

tggctccagc tgcagaatcg ggctagtgaa 

acgtatgaag gtcaactccc tgtgctggcc 

ctagtgaaag aatgttattc tgtcttcaca 

tttaaataat gattgatcca ctgattaaat 

aaggttacct aaaaaatgta cctaaaaaat 

gcccagtggt aacatcttgc aatcttgtat 

atcaggttgg cacaaaagtt aaaatggcaa 

g 



cccttcatca tatgaagcct tgggtggctc 60 
ctaatgaact agaacctaag gttgctgtgt 120 
cataatgatc aaagtctggc ttcctgggtg 180 
gtttaatcag ctccgttgtc cccacacaga 240 
atcacagatc ccgacgtgat cagaacagtg 3 00 
aatcgaaggg taagcatcca ttttttgaaa 3 60 
ttttattttg aaaaaaacat atattcacag 420 
gaaggttcca tgtactcttc atcctgtccc 480 
attgcaatat atatctagta tattcatatt 540 
actacaggct gggcataatg gctcatgcct* 600 

601 



<210> 134 
<211> 603 
<212> DNA 
<213> Homo sapiens 

<400> 134 

agcggaaaac tcaaggaggt atgaaaataa 

aatctgggga caggtagaaa gtaagatcac 
tcgagcttcc taaaaatggt ctttgatctt 
tacaaaatgt cacttactgc tccatgctgg 
tgcacattta actacaggta ctgatctgtt 
gtatggagat gtattggtga gaaacttgag 
cttgaaagag taagtaggag cacagccatg 
tgcctgccat ggagtcgaca gtcgcactgt 
ggcagcgctg caactccaaa gagccaccta 
cagcaaggga aaagggcctt ctctcctgtg 
ctt 



gatgagtctt aattagaaat gtaaagaatg 60 

agtccgtttc caaggggtag tccactgagt 120 
tatgtacaga aaagacatca caaaattcat 180 
agaaagccat atccttctgg gacttgagtc 240 
ttgtgcttag atgttcccca tcattgccca 3 00 
gcg'ggaagca gagaaaggca agcctgtcac 3 60 
gggttctgag ctgtcatgag cccttccagc 420 
tgggttactc cagtgaccag acaaaagcag 480 
agagggagtg gctcccatga ggcggcaagt 54 0 
cacaggagcc aggatttact tatctgttaa 600 

603 



<210> 135 
<211> 603 
<212> DNA 

<213> Homo sapiens ^ 

<400> 135 

agcggaaaac tcaaggaggt atgaaaataa 

aatctgggga caggtagaaa gtaagatcac 

tcgagcttcc taaaaatggt cttttatctt 

tacaaaatgt cacttactgc tccatgctgg 

cgcacattta actacaggta ctgatctgtt 

gtatggagat gtattggtga gaaacttgag 

cttgaaagag taagtaggag cacagccatg 



gatgagtctt aattagaaat gtaaagaatg 60 
agtccgtttc caaggggtag tccactgagt 12 0 
tatgtacaga aaagacatca caaaattcat 180 
agaaagccat atccttctgg gacttgagtc 240 
ttgtgcttag atgttcccca tcattgccca - 300 
Sfcgggaagca gagaaaggca agcctgtcac 3 60 
gggttctgag ctgtcatgag cccttccagc 420 
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tgcctgccat ggagtcgaca gtcgcactgt 
ggcagcgctg caactccaaa gagccaccta 
cagcaaggga aaagggcctt ctctcctgtg 
ctt 



tgggttactc cagtgaccag ^^^^^^ 
agagggagtg gctcccatga ggcggcaagt 540 
cacaggagcc aggatttact tatctgttaa 60 0 

603 



<210> 136 
<211> 626 
<212> DNA 
<213> Homo sapiens 



<400> 136 

attggacatg 

aatgtttttg 

gggagggaag 

acgagtcact 

tcagctgatg 

agtgcttctg 

caggaagtat 

tgttataaat 

ctgaaagggc 

cacctacatg 

ctacttggaa 



atagctagat 
ttcactggtg 
tcgtcaagaa 
ttctgccaaa 
attgactccc 
agggctactg 
tccaggaaga 
ggtagctgga 
aaccctaagc 
agatcaaatg 
atttatatca 



ttgtttcagg 
actcaggtaa 

gggaggttga 

gaaatctctc 
agaattcgaa 
gcggggacac 
tgagaatttt 
ggcactttcc 
aaacctagaa 
ccagttctca 
aacata 



aaaacatcct 
cacgtcttca 
ggactgcact 
cttttgcttc 
agaaactgag 
taagagggag 
tgccacatag 
agaagcccac 
tgcttggagg 
gcctcctcca 



gctttccaag 
agaagccata 
tttgatttac 
tagcaccgac 
tcccacaaag 
ggccttgttc 
cagaacaaca 
aggtatagcc 
acagtcagtg 
gatccaccaa 



gatttagatg 6 0 
gggaggttga 12 0 
ttctgacttc 180 
tagatbtcct 240' 
gtaaccaagg 3 00 
tgaaaatgtg 3 60 
cacatttaga 420 
atgttccagg 480 
gtttgtggat 540 
gtgagaacct 60 0 
626 



<210> 137 
<211> 623 
<212> DNA 
<213> Homo sapiens 



<400> 137 

attggacatg 

aatgtttttg 

gggagggaag 

agtcactttc 

gctgatgatt 

gcttctgagg 

gaattattcc 

tataaatggt 

aaagggcaac 

ctacatgaga 

cttggaaatt 



atagctagat 
ttcactggtg 

tcaagaaggg 
tgccaaagaa 
gactcccaga 
gctactggcg 
aggaagatga 
agctggaggc 
cctaagcaaa 
tcaaatgcca 
tatatcaaac 



ttgtttcagg 
actcaggtaa 
aggttgagga 
atctctcctt 
attcgaaaga 
gggacactaa 
gaatttttgc 
actttccaga 
cctagaatgc 
gttctcagcc 
ata 



aaaacatcct 
cacgtcttca 

ctgcactttt 
ttgcttctag 
aactgagtcc 
gagggagggc 
cacatagcag 
agcccacagg 
ttggaggaca 
tcctccagat 



gctttccaag 
agaagccata 
gatttacttc 
caccgactag 
cacaaaggta 
cttgttctga 
aacaacacac 
tatagccatg 
gtcagtggtt 
ccaccaagtg 



gatttagatg 
gggaggttga 

tgacttcacg 
atttccttca 
accaaggagt 
aaatgtgcag 
atttagatgt 
ttccaggctg 
tgtggatcac 
agaacctcta 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

623 



<210> 138 
<211> 826 
<212> DNA 
<213> Homo sapiens 



<400> 138 

agataaagta 

gaaatgctct 

gtggcccaca 

ggccctgtat 

ttccattttt 

tgtacgttga 

gagctcgcag 

ctttccttca 

gagattgatg 

ggtgaagcct 

taatcactga 

gaaacataga 

cctggttcac 



cttttaggat 
ctctggaaat 
atcatgtaga 
aagatacata 
tcttcttggg 
cctgatttac 
cccagtcaat 
ctttatatga 
cagttttgcc 
tagcaaaaat 
ttccttcact 
gaacggttgc 
ctgtttactg 



cattcaaggc 
tacagcagtg 
ccttgggaaa 
tcagaatgaa 
attagagaac 
ctaaaatgtc 
aatcttcatt 
actggccact 
caataaggtg 
gcctcctcac 
gacataatgt 
tactggcaga 
ttatcacaat 



acacacccat 
ctggtgctgg 
acctggatta 
aaccactccc 
ttcacttaga 
tttcctctcc 
tttgctggct 
caccctgatg 
aggggatgac 
cactccccag 
aggaagcctc 
agcataagat 
aatgctaagt 



aacactgagt 
gatgccatga 
aaatgatttt 
agtgtgactt 
tttcatctaa 
tttcagctct 
atgaaaccac 
tccagcagaa 
ccctggagat 
gagaattttt 
tgaggagaaa 
ctttgtacaa 
aaaaaaaaaa 



atgtaagaca 60 
tgaggagtgt 12 0 
gcgtcatcct 18 0 
tgaattgctt 240 
gctgtgatgt 3 00 
gtctgatctg 360 
cagcagtgtt 42 0 
actgcaaaag 480 
gaagggaaga 540 
ataaaaagca 5 0 0 
aacaaaggga 660 
tatfegctggc 720 
aaaaaaaaaa 780 
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aaaaaaaaaa aggagtgtgg cgagaagatg gccaaacagg aacagc 



<210> 139 
<211> 795 
<212> DNA 
<213> Homo sapiens 



<400> 139 

gtccctgggg 

agtaggtcac 

gatcatttac 

actcagttga 

caccacctac 

cactcagatt 

tcaatggggt 

atgacccaaa 

ggaaatggag 

atgacaatcg 

taattatcaa 

ccagcacaag 

atttttaaaa 

taacaggaaa 



tgaggatggt 
tgagcacatc 
cagtatgagt 
gagttaattc 
ctatgatgcc 
attcccagtt 
attcattccc 
gtactggaca 
ctcaccctga 
tgtggttgta 
actttacaaa 
acaaagataa 
attagaatgg 
ctatc 



cttgaatatc 
aatggacatg 
tattctctgg 
aaaatctcag 
gtggtacaga 
gctattagac 
aaagggtcaa 
gagcctgagg 
cccaggctgg 
caatcatttg 
ccacagacfca 
tcaattatgt 
tatcactttt 



tcctacattc 
ccagttatta 

agcttctaat 
attatccaat 
tggagtacct 
ttgagaggac 
tggtggtgat 
agttccgccc 
ttcaagcata 
cttgtaagtc 
gaaaaaacga 
ccctgtgggc 
tatttggttt 



ataactcctc 
aaatacttca 
acttcagtag 
tctgtttctt 
tgacatggtg 
ttgcaagaaa 
tccaacttat 
tgaaaggtac 
ttctgcctct 
tttttatcac 
aactacatcc 
atttttctac 
gaattgctgc 



cacacatctc 60 
cgaatactat 120 
tactgcatgg 180 
tccttccagg 240 
gtgaatgaaa 3 00 
gatgttgaaa 360 
gctcfctcacc 420 
aagtctccag 480 
cttaatctac 540 
aaaaaagtga 60 0 
atccacagtc 660 
gcctatatag 720 
ttacttgatt 780 
795 



<210> 140 
<211> 796 
<212> DNA 
<213> Homo sapiens 



<400> 140 

gtccctgggg 

agtaggtcac 

gatcatttac 

actcagttga 

caccacctta 

acactcagat 

atcaatgggg 

catgacccaa 

gggaaatgga 

catgacaatc 

ataattatca 

cccagcacaa 

gatttttaaa 

ttaacaggaa 



tgaggatggt 
tgagcacatc 

cagtatgagt 
gagttaattc 
cctatgatgc 
tattcccagt 
tattcattcc 
agtactggac 
gctcaccctg 
gtgtggttgt 
aactttacaa 
gacaaagata 
aattagaatg 
actatc 



cttgaatatc 
aatggacatg 
tattctctgg 
aaaatctcag 
cgtggtacag 
tgctattaga 
caaagggtca 
agagcctgag 
acccaggctg 
acaatcattt 
accacagact 
atcaattatg 
gtatcacttt 



tcctacattc 
ccagttatta 
agcttctaat 
attatccaat 
atggagtacc 
cttgagagga 
atggtggtga 
gagttccgcc 
gttcaagcat 
gcttgtaagt 
agaaaaaacg 
tccctgtggg 
ttatttggtt 



ataactcctc 
aaatacttca 
acttcaatag 
tctgtttctt 
ttgacatggt 
cttgcaagaa 
ttccaactta 
ctgaaaggta 
attctgcctc 
ctttttatca 
aaactacatc 
catttttcta 
tgaattgctg 



cacacatctc 
cgaatactat 

tactgcatgg 
tccttccagg 
ggtgaatgaa 
agatgttgaa 
tgctcttcac 
caagtctcca 
tcttaatcta 
caaaaaagtg 
catccacagt 
cgcctatata 
cttacttgat 



60 
120 

180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
796 



<210> 141 

<211> 59 

<212> PRT 

<213> Homo sapiens 



<400> 141^ 

Ala Leu Ser Asp Leu Glu Leu Ala Ala Gin Ser lie He Phe He Phe 
1 5 ' 10 15 

Ala Gly Tyr Glu Thr Thr Ser Ser Val Leu Ser Phe Thr Leu Tyr Glu 
20 25 30 

Leu Ala Thr His Pro Asp Val Gin Gin Lys Leu Gin Lys Glu He Asp 
35 40 45 



Ala Val Leu Pro Asn Lys Ala Pro Pro Tyr Leu 
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50 



55 



<210> 142 
<211> 795 
<212> DNA 
<213> Homo sapiens 



<400> 142 

gtccctgggg 

agtaggtcac 

gatcatttac 

actcagttga 

caccacctac 

cactcagatt 

tcaatggggt 

atgacccaaa 

ggaaatggag 

atgacaatcg 

taattatcaa 

ccagcacaag 

atttttaaaa 

taacaggaaa 



tgaggatggt 
tgagcacatc 
cagtatgagt 
gagttaattc 
ctatgatgcc 
attcccagtt 
attcattccc 
gtactggaca 
ctcaccctga 
tgtggttgta 
actttacaaa 
acaaagataa 
attagaatgg 
ctatc 



cttgaatatc 
aatggacatg 
tattctctgg 
aaaatctcag 
gtggtacaga 
gctattagac 
aaagggtcaa 
gagcctgagg 
cccaggctgg 
caatcatttg 
ccacagacta 
tcaattatgt 
tatcactttt 



tcctacattc 
ccagttatta 
agcttctaat 
attatccaat 
tggagtacct 
ttgagaggac 
tggtggtgat 
agttccgccc 
ttcaagcata 
cttgtaagtc 
gaaaaaacga 
ccctgtgggc 
tatttggttt 



ataactcctc 
aaatacttca 
acttcaatag 
tctgtttctt 
tgacatggtg 
ttgcaagaaa 
tccaacttat 
tgaaaggtac 
ttctgcctct 
tttttatcac 
aattacatcc 
atttttctac 
gaattgctgc 



cacacatctc 60 
cgaatactat 12 0 
tactgcatgg 18 0 
tccttccagg 240 
gtgaatgaaa 3 0 0 
gatgttgaaa 3 60 
gctcttcacc 42 0 
aagtctccag 48 0 
cttaatctac 540 
aaaaaagtga 600 
atccacagtc 660 
gcctatatag 72 0 
ttacttgatt 780 
795 



<210> 143 
<211> 616 
<212> DNA 
<213> Homo sapiens 



<400> 143 

caggcctggc 

ttttactatc 

tgttaagagt 

tccatctttt 

gatccccttg 

ggtggattca 

ggtcttcaag 

gaaataaaga 

ttgagctctc 

aggtaagtga 

cctctagtta 



acagagtcag 
cagtatttac 
cgtttttata 
cttctttcac 
aaattagaca 
agagatggaa 
aaagctgtgc 
tgggcttaat 
ccagagtctg 
cagataggta 
gcatta 



tgctccataa 
ccagcttata 
tgcttgcaaa 
ttcatttatt 
cgcaaggact 
ccctaagtgg 
cccagaacac 
ctaatgtact 
tgtagagtgt 
gactcagctt 



atattttgtt 
gattaagtat 
gcatttttgt 
aattctccat 
tcttcaacca 
agaatgagtt 
cagagatttc 
gcatgagtag 
tgtgcattat 
ctctgcttct 



aaacgatgga 
gaagagttca 
catatttttt 
atgcttgttt 
gaaaaaccca 
attctaagga 
aacttagtca 
ttggtgattt 
gtagtataaa 
cataggacta 



tggtgagtgc 
agatacatgg 
ctactttgct 
aactattgta 
ttgttctaaa 
cttctacttt 
ataaaacctt 
tgtacattca 
ggaggtgacc 
cctctaccca 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

616 



<210> 144 

<211> 1508 

<212> DNA 

<213> Homo sapiens 



<400> 144 

ccatggctct 

atggactttt 

ttttgtccta 

aaatgtgggg 

tcagaacagt 

cagtgggatt 

ggtcattgcfc 

cccagtatgg 

tcaccttgaa 

gagtgaacat 

tcctaaaatt 

ccccagtttt 

gtaaatctgt 



gttattagca 
taagagactg 
tcgtcagggt 
aacgtatgaa 
gctagtgaaa 
tatgaaaagt 
gtctccaacc 
agatgtattg 
agacatcttt 
cgactctctc 
tggtttctta 
tgaagcatta 
aaacagaatg 



gtttttctgg 
ggaattccag 
ctctggaaat 
ggtcaactcc 
gaatgttatt 
gccatctctt 
ttcaccagcg 
gtgagaaact 
ggggcctaca 
aacaatccac 
gatccattat 
aatgtctctc 
aagaaaagtc 



tgctcctcta 
ggcccacacc 
ttgacacaga 
ctgtgctggc 
ctgtcttcac 
tagctgagga 
gaaaactcaa 
tgaggcggga 
gcatggatgt 
aagacccctt 
ttctctcaat 
tgtttccaaa 
gcctcaacga 



tctatatggg 
tctgcctttg 
gtgctataaa 
catcacagat 
aaatcgaagg 
tgaagaatgg 
ggagatgttc 
agcagagaaa 
gattactggc 
tgtggagagc 
aatactcttt 
agataccata 
caaacaaaag 



acccgtacac 60 
ttgggaaatg 12 0 
aagtatggaa 180 
cccgacgtga 240 
tctttaggcc 3 00 
aagagaatac 3 6 0 
cccatcattg 42 0 
ggcaagcctg 48 0 
acatcatttg 540 
actaagaagt 600 
ccattcctta 660 
aattttttaa 720 
caccgactag 78 0 
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atttccttca 
tgtctgatct 
ccagcagtgt 
aactgcaaaa 
tggtacagat 
ctattagact 
aagggtcaat 
agcctgagga 
tatacacacc 
acatgaaact 
cacagatccc 
taaaggtgga 
atctgcag 



gctgatgatt 
ggagctcgca 
tctttccttc 
ggagattgat 
ggagtacctt 
tgagaggact 
ggtggtgatt 
gttccgccct 
ctttggaact 
tgctctaatc 
cttgaaatta 
ttcaagagat 



gactcccaga 
gcccagtcaa 
actttatatg 
gcagttttgc 
gacatggtgg 
tgcaagaaag 
ccaacttatg 
gaaaggttca 
ggacccagaa 
agagtccttc 
gacacgcaag 
ggaaccctaa 



attcgaaaga 
taatcttcat 
aactggccac 
ccaataaggc 
tgaatgaaac 
atgttgaaat 
ctcttcacca 
gtaagaagaa 
actgcattgg 
agaacttctc 
gacttcttca 
gtggagaaca 



aactgagtcc 
ttttgctggc 
tcaccctgat 
accacctacc 
actcagatta 
caatggggta 
tgacccaaag 
ggacagcata 
catgaggttt 
cttcaaacct 
accagaaaaa 
tcaccatcac 



cacaaagctc 
tatgaaacca 
gtccagcaga 
tatgatgccg 
ttcccagttg 
ttcattccca 
tactggacag 
gatccttaca 
gctctcatga 
tgtaaagaaa 
cccattgttc 
catcactgag 



840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1508 



<210> 145 
<211> 60 
<212> PRT 

<213> Homo sapiens 
<400> 145 

Pro Val Ala lie Arg Leu Glu Arg Thr Cys Lys Lys Asp Val Glu He 
15 10 15 

Asn Gly Val Phe He Pro Lys Gly Ser Met Val Val He Pro Asn Tyr 
20 25 30 

Ala Leu His His Asp Pro Lys Tyr Trp Thr Glu Pro Glu Glu Phe Arg 
35 40 45 

Pro Glu Arg Phe Ser Lys Lys Lys Asp Ser He Asp 
50 55 60 



<210> 146 

<211> 11 

<212> DNA 

<213> Homo sapiens 

<400> 146 

tttcagtatc t 11 



<210> 147 
<211> 11 

<212> DNA 

<213> Homo sapiens 
<400> 147 

tttcaatatc t 11 



<210> 148 

<211> 11 

<212> DNA 

<213> Homo sapiens 

<400> 148 

tttaatagaa g 11 



<210> 149 
<211> 11 
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<212> DNA 
<213> Homo 



sapiens 



<400> 149 
tttaacagaa 



g 



11 



<210> 150 
<211> 11 
<212> DNA 

<213> Homo sapiens 
<400> 150 

attccccata g 11 



<210> 151 
<211> 11 

<212> DNA ' 
<213> Homo sapiens 

<400> 151 

attcctcata g 11 



<210> 152 

<211> 11 

<212> DNA 

<213> Homo sapiens 

<400> 152 

tagaatatga a 11 



<210> 153 
<211> 11 
<212> DNA 

<213> Homo sapiens 
<400> 153 

tagaacatga a ' 11 



<210> 154 
<211> 11 

<212> DNA 

<213> Homo sapiens 
<400> 154 

gtaacttatc c 11 



<210> 155 

<211> 11 

<212> DNA 

<213> Homo sapiens 

<400> 155 

gtaacgtatc c 11 



<210> 156 
<211> 11 
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<212> DMA. 

<213> Homo sapiens 



<400> 156 
ttcacgtgga g 



11 



<210> 157 
<211> 11 
<212> DNA 

<213> Homo sapiens 



<400> 157 
ttcacatgga g 



11 



<210> 158 
<211> 11 
<212> DNA 

<213> Homo sapiens 



<400> 158 
tggacgcaac t 



11 



<210> 159 
<211> 11 
<212> DNA 

<213> Homo sapiens 



<400> 159 
tggacacaac t 



11 



<210> 160 

<211> 11 

<212> DNA 

<213> Homo sapiens 



<400> 160 
gaggataatt a 



11 



<210> 161 
<211> 11 

<212> DNA 

<213> Homo sapiens 



<400> 161 
gaggaaaatt a 



11 



<210> 162 

<211> 11 

<212> DNA 

<213> Homo sapiens 



<400> 162 
catcattgcc c 



11 



<210> 163 
<211> 11 
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<212> DNA 
<213> Homo 



sapiens 



<400> 163 
catcactgcc 



c 



11 



<210> 164 

<211> 11 

<212> DNA 

<213> Homo sapiens 

<400> 164 

cagtcgcact g 11 



<210> 165 

<211> 11 

<212> DNA 

<213> Homo sapiens 

<400> 165 

cagtcacact g 11 



<210> 166 

<211> 11 

<212> DNA 

<213> Homo sapiens 

<400> 166 

actaagaagt t 11 



<210> 167 

<211> 11 

<212> DNA 

<213> Homo sapiens 

<400> 167 

actaaaaagt t 11 



<210> 168 
<211> 11 

<212> DNA 

<213> Homo sapiens 

<400> 168 

gatccattat t 11 



<210> 169 

<211> 11 

<212> DNA 

<213> Homo sapiens 

<400> 169 

gatccgttat t 11 



<210> 170 
<211> 11 
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<212> DNA 
<213> Homo 



sapiens 



<400> 170 
caattccatt 



g 



11 



<210> 171 

<211> 11 

<212> DNA 

<213> Homo sapiens 

<400> 171 

ca,atttcatt g 11 



<210> 172 

<211> 11 

<212> DNA 

<213> Homo sapiens 

<400> 172 

tgtcaatcta g 11 



<210> 173 

<211> 11 

<212> DNA 

<213> Homo sapiens 

<400> 173 

tgtcagtcta g 11 



<210> 174 

<211> 11 

<212> DNA 

<213> Homo sapiens 

<400> 174 

ttgtttgttt t 11 



<210> 175 
<211> 11 

<212> DNA 

<213> Homo sapiens 

' <400> 175 

ttgttcgttt t 11 



<210> 176 

<211> 11 

<212> DNA 

<213> Homo sapiens 

<400> 176 

aaataaagaa g 11 



<210> 177 
<211> 11 
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<212> DNA 
<213> Homo 



sapiens 



<400> 177 
aaatacagaa 



g 



11 



<210> 178 

<211> 11 

<212> DNA 

<213> Homo sapiens 

<400> 178 

tttgcgtcat c 11 



<210> 179 

<211> 11 

<212> DNA 

<213> Homo sapiens 

<400> 179 

tttgcatcat c 11 



<210> 180 

<211> 11 

<212> DNA 

<213> Homo sapiens 

<400> 180 

ttgacctgat t 11 



<210> 181 

<211> 11 

<212> DNA 

<213> Homo sapiens 

<400> 181 

ttgacttgat t 11 



<210> 182 
<211> 11 

<212> DNA 

<213> Homo sapiens 

<400> 182 

tattgtagat c 11 



<210> 183 

<211> 11 

<212> DNA 

<213> Homo sapiens 

<400> 183 

tattgcagat c 11 



<210> 184 
<211> 11 
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<212> DNA 
<213> Homo 



sapiens 



<400> 184 
ccttttccct 



t 



11 



<210> 185 

<211> 11 

<212> DNA 

<213> Homo sapiens 

<400> 185 

cctttcccct t 11 



<210> 186 

<211> 11 

<212> DNA 

<213> Homo sapiens 

<400> 186 

cttatgcaga t 11 



<210> 187 

<211> 11 

<212> DNA 

<213> Homo sapiens 

<400> 187 

cttatacaga t 11 



<210> 188 

<211> 11 

<212> DNA 

<213> Homo sapiens 

<400> 188 

aatacggtca t 11 



<210> 189 

<211> 11 

<212> DNA 

<213> Homo sapiens 

<400> 189 

aatacagtca t 11 



<210> 190 

<211> 11 * ■ 

<212> DNA 

<213> Homo sapiens 

<400> 190 

ggaggtatga a ' 11 



<210> 191 
<211> 11 
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<212> DNA 

<213> Homo sapiens 

<400> 191 

&gaggcatga a 11 



<210> 192 
<211> 11 

<212> DNA 

<213> Homo sapiens 

<400> 192 

agtccgtttc c 11 



<210> 193 

<211> 11 

<212> DNA 

<213> Homo sapiens 

<400> 193 

agtccatttc c 11 



<210> 194 

<211> 11 

<212> DNA 

<213> Homo sapiens 

<400> 194 

tctgccaaag a 11 



<210> 195 

<211> 11 

<212> DNA 

<213> Homo sapiens 

<400> 195 

tctgcgaaag a 11 



<210> 196 

<211> 11 

<212> DNA 

<213> Homo sapiens 

<400> 196 

acccattgtt c 11 



<210> 197 

<211> 11 

<212> DNA 

<213> Homo sapiens 

<400> 197 

acccactgtt c 11 



<210> 198 
<211> 11 
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<212> DNA 
<213> Homo 



sapiens 



<400> 198 
gctactggct 



g 



11 



<210> 199 
<211> 11 

<212> DNA 

<213> Homo sapiens 
<400> 199 

gctaccggct g 11 



<210> 200 

<211> 11 

<212> DNA 

<213> Homo sapiens 

<400> 200 

gaaatcaccc g 11 



<210> 201 
<211> 11 
<212> DNA 

<213> Homo sapiens 
<400> 201 

gaaattaccc g 11 



<210> 202 

<211> 33 

<212> DNA 

<213> Homo sapiens 

<400> 202 

gctctactgt catttctaac cataatctct tta 33 



<210> 203 

<211> 27 

<212> DNA 

<213> Homo sapiens 

<400> 203 

gcttcatatg atgaagggta atgtggt 27 



<210>' 204 

<211> 18" 

<212> DNA 

<213> Homo sapiens 

<400> 204 

tgtctttcag tatctctt ' 18 



<210> 205 
<211> 19 
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<212> DNA 

<213> Homo sapiens 
<400> 205 

tgtctttcaa tatctcttc 19 



<210> 206 
<211> 444 
<212> DNA 

<213> Homo sapiens 

<400> 206 

catttagtcc ttgtgagcac 
ttctttttga taatgaagta 
tacccacgta tgtaccaccc 
ctttaaagag ctcttttgtc 
tcatatgaag ccttgggtgg 
actagaacct aaggttgctg 
atcaaagtct ggcttcctgg 
cagctccgtt gtccccacac 



ttgatgattt acctgccttc 
ttttaaacat ataaaacatt 
agcttaacga atgctctact 

tttcaatatc tcttccctgt 
ctcctgtgtg agactcttgc 
tgtgtcgtac aactaggggt 
gtgtggctcc agctgcagaa 
agaa 



aatttttcac tgacctaata 60 
atggagagtg gcataggaga 12 0 
gtcatttcta accataatct 180 

ttggaccaca ttacccttca 240 
tgtgtgtcac accctaatga 3 00 
atggattaca taacataatg 360 
tcgggctagt gaagtttaat 42 0 

444 



<210> 207 
<211> 830 
<212> DNA 

<213> Homo sapiens 



<400> 207 

tggtcaccca ccatgtgtac 
ccagactgtg cccttgagga 
gggtggtaga gaggaaatag 
gaggaaatgg ttacatctgt 
tgtctggctg ggcttgcaag 
gcagagggaa ttgcatgggt 
attctggaat gaagacagcc 
atgccaatgg ctccacttca 
aactgattaa gttgtttatg 
ggtgtgtgcg attctttgct 
aaacatttca gcagcttgac 
gcccagcaaa caacagcaca 
aagtggcgat ggacctcatc 
gcctggtgct cctctatctg 



agtaccctgc tagggtccag 
actcacctct gctaagggaa 
gacaatagga ctgtgtgagg 
gtgaggaggt tggtaaggaa 
gatgtgtagg agtcatctag 
aaagatctgc agttgtggct 
atggaaacaa gggcaggtga 
gtttctgata agaactcagg 
attccccata gaatatgaac 
actggctgca gctgcagccc 
ctaagactgc tgtgcagggc 
cagctgaaag taagactcag 
ccaaatttgg cggtggaaac 
tcagtaactg tccagattcc 



ggtcatgaaa gtaaataata 60 
acaggcacag aaacccacaa 12 0 
gggataggag gcacccagag 180 
agactttaac agaaggggtc 240 
ggggcacaag tacactccag 3 00 
tgtggggatg gatttcaagt 3 60 
gaggatattt aagaggcttc 42 0 
ttccgtggac tccctgataa 480 
tcaaaggagg taagcaaagg 540 
cacctccttc tccagcacat 600 
agggatgctc caggcagaca 660 
aggagacagt tgaagaaggc 72 0 
ctggcttctc ctggctgtca 780 
tctcctctgt 830 



<210> 208 
<211> 830 
<212> DNA 
<213> Homo sapiens 

<400> 208 

tggtcaccca ccatgtgtac 
ccagactgtg cccttgagga 
gggtggtaga gaggaaatag 
gaggaaatgg ttacatctgt 
tgtctggctg ggcttgcaag 
gcagagggaa ttgcatgggt 
attctggaat gaagacagcc 
atgccaatgg ctccacttca 
aactgattaa gttgtttatg 
ggtgtgtgcg attctttgct 
aaacatttca gcagcttgac 
gcccagcaaa caacagcaca 



agtaccctgt: tagggtccag 
actcacctct gctaagggaa 
gacaatagga ctgtgtgagg 
gtgaggaggt tggtaaggaa 
gatgtgtagg agtcatctag 
aaagatctgc agttgtggct 
atggaaacaa gggcaggtga 
gtttctgata agaactcagg 
attcctcata gaatatgaac 
actggctgca gctgcagccc 
ctaagactgc tgtgcagggc 
cagctgaaag taagactcag 



ggtcatgaaa gtaaataata 60" 
acaggcacag aaacccacaa 12 0 
gggataggag gcacccagag 180. 
agactttaat agaaggggtc 240 
ggggcacaag tacactccag 3 0 0' 
tgtggggatg gatttcaagt 3 60 
gaggatattt aagaggcttc 42 0 
ttccgtggac tccctgataa 480 
tcaaaggagg taagcaaagg 540 
cacctccttc tccagcacat 600 
agggatgctc caggcagaca 660 
aggagacagt tgaagaaggc 720 
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aagtggcgat ggacctcatc ccaaatttgg cggtggaaac ctggcttctc ctggctgtca 780 
gcctggtgct cctctatctg tcagtaactg tccagattcc tctcctctgt 83 0 



<210> 209 
<211> 830 

<212> DNA 

<213> Homo sapiens 



<400> 209 

tggtcaccca 

ccagactgtg 

gggtggtaga 

gaggaaatgg 

tgtctggctg 

gcagagggaa 

attctggaat 

atgccaatgg 

aactgattaa 

ggtgtgtgcg 

aaacatttca 

gcccagcaaa 

aagtggcgat 

gcctggtgct 



ccatgtgtac 
cccttgagga 
gaggaaatag 
ttacatctgt 
ggcttgcaag 
ttgcatgggt 
gaagacagcc 
ctccacttca 
gttgtttatg 
attctttgct 
gcagcttgac 
caacagcaca 
ggacctcatc 
cctctatctg 



agtaccctgc 
actcacctct 
gacaatagga 
gtgaggaggt 
gatgtgtagg 
aaagatctgc 
atggaaacaa 
gtttctgata 
attccccata 
actggctgca 
ctaagactgc 
cagctgaaag 
ccaaatttgg 
tcagtaactg 



tagggtccag 

gctaagggaa 
ctgtgtgagg 
tggtaaggaa 
agtcatctag 
agttgtggct 
gggcaggtga 
agaactcagg 
gaacatgaac 
gctgcagccc 
tgtgcagggc 
taagactcag 
cggtggaaac 
tccagattcc 



ggtcatgaaa 

acaggcacag 
gggataggag 
agactttaat 

ggggcacaag 

tgtggggatg 
gaggatattt 
ttccgtggac 
tcaaaggagg 
cacctccttc 
agggatgctc 
aggagacagt 
ctggcttctc 
tctcctctgt 



gtaaataata 

aaacccacaa 
gcacccagag 
agaaggggtc 
tacactccag 
ga-tttcaagt 
aagaggcttc 
tccctgataa 
taagcaaagg 
tccagcacat 
caggcagaca 
tgaagaaggc 
ctggctgtca 



60 

120 
180 
240 
300 
360- 
420 
480 
540 
600 
660 
720 
780 
830 



<210> 210 
<211> 837 
<212> DNA 

<213> Homo sapiens 



<400> 210 

caaaaattag 

caggagaatc 

tccagcctgg 

tttgaagcac 

tttaatgttt 

cacctcttgc 

aacagtcaat 

ccacttcttt 

cagccagatg 

gagacctagt 

gtgggagtcc 

agagcctaaa 

gtgggaaggg 

gtatactgga 



ctaggtgtgg 
acttgaacct 
gtgacagagt 
atacatttta 
tcaaaccatc 
attccattct 
tggtcaattg 
ctttactttc 
gtggccacac 
agggtgagga 
aagcaagcag 
cagggcatgg 
atgggtgagg 
agccaggtgt 



tggtatgcac 
ggaggcagag 
gagattccat 
taacatacaa 
tcccctgata 
caactcccaa 
attgggcaac 
ccttcctgag 
attaaggtag 
tcaagtaggt 
agaaaatgtc 
agaacatatt 
tttcactaca 
gtcacttttg 



ctataatctc 
gttgcagtga 
ctcaaaaaaa 
ctgaatccct 
tttctgggag 
ctgtcttact 
aggctaaaca 
taacgtatcc 
aaaagagagt 
gttcacgtgg 
gacacagagg 
tagggcatga 
taaaggggat 
cagaaaagag 



agctacccag 
gccgagacgc 
aaaaaaaaaa 
tattatatta 
atgggaaaca 
gcaatgaaca 
cactcattcc 
taaagtcatt 
gtcatgatgg 
agaaacagcc 
ggtggcctga 
ggtgaggagg 
tgatgaaata 
tcatggattc 



gaggctgagg 
accattacac 
attatgcctt 
ttagttttga 
tgttttctta 
cttaataaga 
ttgtctgttc 
aggtgggtgg 
ttccaagtca 
cggcctgtgt 
aaaagcagcc 
gcatccatga 
agtaaataaa 
agaaagg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

837 



<210> 211 
<211> 83*7 

<212> DNA 

<213> Homo sapiens 



<400> 211 

caaaaattag 

caggagaatc 

tccagcctgg 

tttgaagcac 

tttaatgttt 

cacctcttgc 

aacagtcaat 

ccacttcttt 

cagccagatg 



ctaggtgtgg 
acttgaacct 
gtgacagagt 
atacatttta 
tcaaaccatc 
attccattct 
tggtcaattg 
ctttactttc 
gtggccacac 



tggtatgcac 
ggaggcagag 
gagattccat 
taacatacaa 
tcccctgata 
caactcccaa 
attgggcaac 
ccttcctgag 
attaaggtag 



ctataatctc 
gttgcagtga 
ctcaaaaaaa 
ctgaatccct 
tttctgggag 
ctgtcttact 
aggctaaaca 
taacttatcc 
aaaagagagt 



agctacccag 
gccgagacgc 
aaaaaaaaaa 
tattatatta 
atgggaaaca 
gcaatgaaca 
cactcattcc 
taaagtcatt 
gtcatgatgg 



gaggctgagg 60 
accattacac 120 
attatgcctt 180 
ttagttttga 240 
tgttttctta 300 
cttaataaga 360 
ttgtctgttc 420 
aggtgggtgg 480 
ttccaagtca 540 
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gagacctagt agggtgagga tcaagtaggt 
gtgggagtcc aagcaagcag agaaaatgtc 
agagcctaaa cagggcatgg agaacatatt 
gtgggaaggg atgggtgagg tttcactaca 
gtatactgga agccaggtgt gtcacttttg 



gttcacatgg agaaacagcc 

gacacagagg ggtggcctga aaaagcagcc 66 0 
tagggcatga ggtgaggagg gcatccatga 72 0 
taaaggggat tgatgaaata agtaaataaa 780 
cagaaaagag tcatggattc agaaagg 837 



<210> 212 
<211> 602 
<212> DNA 

<213> Homo sapiens 
<400> 212 

catagacaag ggtgagtcct tcagtactta 
cacttcaaat atattctctg ttttcttgtc 
ttcaactgcc agtgaaagat agcaggcctg 
aattagaggt agggtttatt ctatttaaaa 
ccagggtctc tggaaatttg acacagagtg 
agtattctga aaacctccat tggatagacc 
gatagtctct gcccaggtct tcatgggatg 
agaggttctc tgaaagaaga ggataattac 
cttgccgtta taaactatgt gcaaattcag 
aaaatatgag aagaatctca taactgtttt 
ta 



gagaaaattc aagagtgact ttaaattccc 60 
tttcccttaa gacatctctg aatagcttcc 120 
atttcattgg acacaactgt tt-tcagcccc 180. 
taataatcaa cttgtatttt gtttcctctc 240 
ctataaaaag tatggaaaaa tgtgggggtg 3 00 
tgctactgtg aggaggttac cccactgcag 3 60 
aagctcttgt caacctaaat acaaacagag 420 
ttgggagtag aatattgcaa tgggaatctg 480 
ggaggtaaac aagacaaaga tgctccatag 540 
gagataatta ttgttagcta caaagatcaa 600 

602 



<210> 213 
<211> 602 
<212> DNA 

<213> -Homo sapiens 
<400> 213 

catagacaag ggtgagtcct tcagtactta 
cacttcaaat atattctctg ttttcttgtc 
ttcaactgcc agtgaaagat agcaggcctg 
aattagaggt agggtttatt ctatttaaaa 
ccagggtctc tggaaatttg acacagagtg 
agtattctga aaacctccat tggatagacc 
gatagtctct gcccaggtct tcatgggatg 
agaggttctc tgaaagaaga ggaaaattac 
cttgccgtta taaactatgt gcaaattcag 
aaaatatgag aagaatctca taactgtttt 
ta 



gagaaaattc aagagtgact ttaaattccc 60 
tttcccttaa gacatctctg aatagcttcc 120 
atttcattgg acgcaactgt tttcagcccc 180 
taataatcaa cttgtatttt gtttcctctc 240 
ctataaaaag tatggaaaaa tgtgggggtg 300 
tgctactgtg aggaggttac cccactgcag 360 
aagctcttgt caacctaaat acaaacagag 420 
ttgggagtag aatattgcaa tgggaatctg ,480 
ggaggtaaac aagacaaaga tgctccatag 540 
gagataatta ttgttagcta caaagatcaa 600 

602 



<210> 214 

<211> 603 

<212> DNA • 

<213> Homo sapiens 

<400> 214 

agcggaaaac tcaaggaggt atgaaaataa 
aatctgggga caggtagaaa gtaagatcac 
tcgagcttcc taaaaatggt cttttatctt 
tacaaaatgt cacttactgc tccatgctgg 
tgcacattta actacaggta ctgatctgtt 
gtatggagat gtattggtga gaaacttgag 
cttgaaagag taagtaggag cacagccatg 
tgcctgccat ggagtcgaca gtcgcactgt 
ggcagcgctg caactccaaa gagccaccta 
bagcaaggga aaagggcctt ctctcctgtg 
ctt 



gatgagtctt aattagaaat gtaaagaatg 60 
agtccgtttc caaggggtag tccactgagt 12 0 
tatgtacaga aaagacatca caaaattcat 180 
agaaagccat atccttctgg gacttgagtc 240 
ttgtgcttag atgttcccca tcactgccca 300 
gcgggaagca gagaaaggca agcctgtcac 360 
gggttctgag ctgtcatgag cccttccagc 420 
tgggttactc cagtgaccag acaaaagcag 48 0 
agagggagtg gctcccatga ggcggcaagt 540 
cacaggagcc aggatttact tatctgttaa 600 

603 
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<210> 215 
<211> 60 
<212> PRT 

<213> Homo sapiens 

<400> 215 

Ser Leu Ala Glu Asp Glu Glu Trp Lys Arg lie Arg Ser Leu Leu Ser 
15 10 15 

Pro Thr Phe Thr Ser Gly Lys Leu Lys Glu Met Phe Pro lie Thr Ala 
20 25 30 

Gin Tyr Gly Asp Val Leu Val Arg Asn Leu Arg Arg Glu Ala Glu Lys 
35 40 45 

Gly Lys Pro Val Thr Leu Lys Asp lie Phe Gly Ala 
50 '55 . . 60 



<210> 216 
<211> 603 
<212> DNA 
<213> Homo sapiens 



<400> 216 

agcggaaaac 

aatctgggga 

tcgagcttcc 

tacaaaatgt 

tgcacattta 

gtatggagat 

cttgaaagag 

tgcctgccat 

ggcagcgctg 

cagcaaggga 

ctt 



tcaaggaggt 
caggtagaaa 
taaaaatggt 
cacttactgc 
actacaggta 
gtattggtga 
taagtaggag 
ggagtcgaca 
caactccaaa 
aaagggcctt 



atgaaaataa 
gtaagatcac 
cttttatctt 
tccatgctgg 
ctgatctgtt 
gaaacttgag 
cacagccatg 
gtcacactgt 
gagccaccta 
ctctcctgtg 



gatgagtctt 
agtccgtttc 
tatgtacaga 
agaaagccat 
ttgtgcttag 
gcgggaagca 
gggttctgag 

tgggttactc 

agagggagtg 
cacaggagcc 



aattagaaat 
caaggggtag 
aaagacatca 
atccttctgg 
atgttcccca 
gagaaaggca 
ctgtcatgag 
cagtgaccag 
gctcccatga 
aggatttact 



gtaaagaatg 60 
tccactgagt 12 0 
caaaattcat 18 0 
gacttgagtc 240 
tcattgccca 3 00 
agcctgtcac 360 
cccttccagc 42 0 
acaaaagcag 48 0 
gacggcaagt 540 
tatctgttaa 60 0 
603 



<210> 217 
<211> 778 
<212> DNA 

<213> Homo sapiens 



<400> 217 

cagagcttac 

tgggtctctg 

ggtgtctcca 

ctgagggagt 

tgtcttgact 

ctgattttaa 

tgattactgg 

ttgtggagag 

taagtatgtg 

aattcacata 

tcaaaaatat 

tcacacactt 

caactttctg 



atatcttata 
tcttcctatg 
tcacccccag 
agatggaaga 
ggatatgtgg 
ttttGcatat 
cacatcattt 
cactaaaaag 
ggctattatt 
tcgtataatt 
gtatgaccat 
tagctgtcaa 
cctctataga 



tcatccacac 
actgggctcc 
catagggcca 
tgattcagca 
gtttcttgct 
ctttctccac 
ggagtgaaca 
ttcctaaaat 
tctttctctc 
catccactta 
tactattgta 
caccccacca 
tttgcctatt 



tcaacacatg 
ttgacctcag 
gctccatcac 
gatagttctg 
gcatgtatag 
tcagcatctt 
tcgactctct 
ttggtttctt 
tttttaaaaa 
aaaggtacaa 
aactaaaatg 
caaaccccac 
ctggacactt 



ctactgtagt 
aggtgagtct 
tggcaccaga 
aaagtctgtg 
tggaaggacg 
tggggcctac 
caacaatcca 
agatccatta 
taactgcttt 
ttccattgtt 
tttttgtcaa 
tgccctaagc 
catagaaata 



tgtctgataa 
aactcagctt 
taaccacctt 
gctctttatg 
gtaagaggtg 
agcatggatg 
caagacccct 
tttctctcaa 
cttgacatat 
tttaagataa 
tctagagccc 
atccaataat 
atatcatt 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

778 



<210> 218 
<211> 778 
<212> DNA 
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<213> Homo sapiens 
<400> 218 

cagagcttac atatcttata tcatccacac tcaacacatg ctactgtagt tgtctgataa 60 
tgggtctctg tcttcctatg actgggctcc ttgacctcag aggtgagtct aactcagctt 120 
ggtgtctcca tcacccccag catagggcca gctccatcac tggcaccaga taaccacctt 180 
ctgagggagt agatggaaga tgattcagca gatagttctg aaagtctgtg gctctttatg 240 
tgtcttgact ggatatgtgg gtttcttgct gcatgtatag tggaaggacg gtaagaggtg 3 00 
ctgattttaa ttttccatat ctttctccac tcagcatctt tggggcctac agcatggatg 3 60 
tgattactgg cacatcattt ggagtgaaca tcgactctct caacaatcca caagacccct 420 
ttgtggagag cactaagaag ttcctaaaat ttggtttctt agatccgtta tttctctcaa 480 
taagtatgtg ggctattatt tctttctctc tttttaaaaa taactgcttt cttgacatat 540 
aattcacata tcgtataatt catccactta aaaggtacaa ttccattgtt tttaagataa 600 
tcaaaaatat gtatgaccat tactattgta aactaaaatg tttttgtcaa tctagagccc 660 
tcacacactt tagctgtcaa caccccacca caaaccccac tgccctaagc atccaataat 720 
caactttctg cctctataga tttgcctatt ctggacactt catagaaata atatcatt 778 



<210> 219 
<211> 778 
<212> DNA 
<213> Homo sapiens 

<400> 219 

cagagcttac atatcttata tcatccacac tcaacacatg ctactgtagt tgtctgataa 60 
tgggtctctg tcttcctatg actgggctcc ttgacctcag aggtgagtct aactcagctt 12 0 
ggtgtctcca tcacccccag catagggcca gctccatcac tggcaccaga taaccacctt 180 
ctgagggagt agatggaaga tgattcagca gatagttctg aaagtctgtg gctctttatg 240 
tgtcttgact ggatatgtgg gtttcttgct gcatgtatag tggaaggacg gtaagaggtg 300 
ctgattttaa ttttccatat ctttctccac tcagcatctt tggggcctac agcatggatg 360 
tgattactgg cacatcattt ggagtgaaca tcgactctct caacaatcca caagacccct 42 0 
ttgtggagag cactaagaag ttcctaaaat ttggtttctt agatccatta tttctctcaa 480 
taagtatgtg ggctattatt tctttctctc tttttaaaaa taactgcttt cttgacatat 540 
aattcacata tcgtataatt catccactta aaaggtacaa tttcattgtt tttaagataa 600 
tcaaaaatat gtatgaccat tactattgta aactaaaatg tttttgtcaa tctagagccc 660 
tcacacactt tagctgtcaa caccccacca caaaccccac tgccctaagc atccaataat 720 
caactttctg cctctataga tttgcctatt ctggacactt catagaaata atatcatt 778 



<210> 220 
<211> 778 
<212> DNA 
<213> Homo sapiens 

<400> 220 

cagagcttac atatcttata tcatccacac tcaacacatg ctactgtagt tgtctgataa 60 
tgggtctctg tcttcctatg actgggctcc ttgacctcag aggtgagtct aactcagctt 120 
ggtgtctcca tcacccccag catagggcca gctccatcac tggcaccaga taaccacctt 18 0 
ctgagggagt agatggaaga tgattcagca g;atagttctg aaagtctgtg gctctttatg 240 
tgtcttgact ggatatgtgg gtttcttgct gcatgtatag tggaaggacg gtaagaggtg 300 
ctgattttaa ttttccatat ctttctccac tcagcatctt tggggcctac agcatggatg 360 
tgattactgg cacatcattt gga*gtgaaca tcgactctct caacaatcca caagacccct 42 0 
ttgtggagag cactaagaag ttcctaaaat ttggtttctt agatccatta tttctctcaa 480 
taagtatgtg ggctattatt tctttctctc tttttaaaaa taactgcttt cttgacatat 540 
aattcacata tcgtataatt catccactta aaaggtacaa ttccattgtt tttaagataa 600 
tcaaaaatat gtatgaccat tactattgta aactaaaatg tttttgtcag tctagagccc 660 
tcacacactt tagctgtcaa caccccacca caaaccccac tgccctaagc atccaataat 720 
caactttctg cctctataga tttgcctatt 'ctggacactt catagaaata atatcatt 778 



<210> 221 
<211> 670 
<212> DNA 
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<213> Homo sapiens 



<400> 221 

agaaggtgcc attgatctca ctgctgtagt 
tcagtcgccg gcctgaaaga agggcaaaca 
catgatgttc ttattcttat tactggtaga 
cattttcaat gatttccttt tgttcgtttt 
accccagttt ttgaagcatt aaatgtctct 
agtaaatctg taaacagaat gaagaaaagt 
gatggtggtt aaatgacgat gtttaggttt 
agcatgtatc tgtattttta aaaataaaga 
catttggtag aaataaagaa ggagattggg 
catttaaaac ttgaaatcag gcacaacaat 
taaaatttta ccacttctct tccctttaat 
aaaatatatt 



ggtgtttcct atgtatagac ctgcccttgc 60 
tgataaaagg aatgggttcc agttgagaat 120 
gaaaattata attgctccag gtaaagtttg 180 
gtttttccca cagtactctt tccattcctt 240 
ctgtttccaa aagataccat aaatttttta 300 
cgcctcaacg acaaacaaaa ggtaaaatct 360 
tgataaattt agattttata cacatgatag 420 
cagagaactt atgtttagaa caagagaagc 480 
gaaggagatg agaatgagtc agagagatag 540 
tagtatgtca tgatataaac agtattgaga 60 0 
aaattgtcaa aggataaagt ttcctgtttg 660 

670 



<210> 222 
<211> 670 
<212> DNA 
<213> Homo sapiens 



<400> 222 

agaaggtgcc attgatctca ctgctgtagt 
tcagtcgccg gcctgaaaga agggcaaaca 
catgatgttc ttattcttat tactggtaga 
cattttcaat gatttccttt tgtttgtttt 
accccagttt ttgaagcatt aaatgtctct 
agtaaatctg taaacagaat gaagaaaagt 
gatggtggtt aaatgacgat gtttaggttt 
agcatgtatc tgtattttta aaaataaaga 
catttggtag aaatacagaa ggagattggg 
catttaaaac ttgaaatcag gcacaacaat 
taaaatttta ccacttctct tccctttaat 
aaaatatatt 



ggtgtttcct atgtatagac ctgcccttgc 60 
tgataaaagg aatgggttcc agttgagaat 120 
gaaaattata attgctccag gtaaagtttg 180 
gtttttccca cagtactctt tccattcctt 240 
ctgtttccaa aagataccat aaatttttta 300 
cgcctcaacg acaaacaaaa ggtaaaatct 360 
tgataaattt agattttata cacatgatag 420 
cagagaactt atgtttagaa caagagaagc 480 
gaaggagatg agaatgagtc agagagatag 540 
tagtatgtca tgatataaac agtattgaga 600 
aaattgtcaa aggataaagt ttcctgtttg 660 

670 



<210> 223 
<211> 826 
<212> DNA 
<213> Homo sapiens 



<400> 223 

agataaagta cttttaggat cattcaaggc 
gaaatgctct ctctggaaat tacagcagtg 
gtggcccaca atcatgtaga ccttgggaaa 
ggccctgtat aagatacata tcagaatgaa 
ttccattttt tcttcttggg attagagagc 
tgtacgttga cctgatttac ctaaaatgtc 
gagctcgcag cccagtcaat aatcttcatt 
ctttccttca ctttatatga actggccact 
gagattgatg cagttttgcc caataaggtg 
ggtgaagcct tagcaaaaat gcctcctcac 
taatcactga ttccttcact gacataatgt 
gaaacataga gaacggttgc tactggcaga 
cctggttcac ctgtttactg ttatcacaat 
aaaaaaaaaa aggagtgtgg cgagaagatg 



acacacccat aacactgagt atgtaagaca 60 
ctggtgctgg gatgccatga tgaggagtgt 120 
acctggatta aaatgatttt gcatcatcct 180 
aaccactccc agtgtgactt tgaattgctt 240 
ttcacttaga tttcatctaa gctgtgatgt 300 
tttcctctcc tttcagctct gtctgatctg 360 
tttgctggct atgaaaccac cagcagtgtt 420 
caccctgatg tccagcagaa actgcaaaag 480 
aggggatgac ccctggagat gaagggaaga 540 
cactccccag gagaattttt ataaaaagca 60 0 
aggaagcctc tgaggagaaa aacaaaggga 660 
agcataagat .ctttgtacaa tattgctggc 720 
aatgctaagt aaaaaaaaaa aaaaaaaaaa 780 
gccaaacagg aacagc 82 6 



<210> 224 
<211> 826 
<212> DNA 
<213> Homo sapiens 
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<400> 224 

agataaagta cttttaggat cattcaaggc 
gaaatgctct ctctggaaat tacagcagtg 

gtggcccaca atcatgtaga ccttgggaaa 
ggccctgtat aagatacata tcagaatgaa 
ttccattttt tcttcttggg attagagagc 
tgtacgttga cttgatttac ctaaaatgtc 
gagctcgcag cccagtcaat aatcttcatt 
ctttccttca ctttatatga actggccact 
gagattgatg cagttttgcc caataaggtg 
ggtgaagcct tagcaaaaat gcctcctcac 
taatcactga ttccttcact gacataatgt 
gaaacataga gaacggttgc tactggcaga 
cctggttcac ctgtttactg ttatcacaat 
aaaaaaaaaa aggagtgtgg cgagaagatg 



acacacccat aacactgagt atgtaagaca 60 
ctggtgctgg gatgccatga tgaggagtgt 120 
acctggatta aaatgatttt gcgtcatcct 180 
aaccactccc agtgtgactt tgaattgctt 240 
ttcacttaga tttcatctaa gctgtgatgt 300 
tttcctctcc tttcagctct gtctgatctg 360 
tttgctggct atgaaaccac cagcagtgtt 420 
caccctgatg tccagcagaa actgcaaaag 480 
aggggatgac ccctggagat gaagggaaga 540 
cactccccag gagaattttt ataaaaagca 600 
aggaagcctc tgaggagaaa aacaaaggga 660 
agcataagat ctttgtacaa tattgctggc 72 0 
aatgctaagt aaaaaaaaaa aaaaaaaaaa 780 
gccaaacagg aacagc * 826 



<210> 225 
<211> 616 
<212> DNA 
<213> Homo sapiens 

<400> 225 

caggcctggc acagagtcag tgctccataa 
ttttactatc cagtatttac ccagcttata 

tgttaagagt cgtttttata tgcttgcaaa 
tccatctttt cttctttcac ttcatttatt 
gatccccttg aaattagaca cgcaaggact 
ggtggattca agagatggaa ccctaagtgg 
ggtcttcaag aaagctgtgc cccagaacac 
gaaataaaga tgggcttaat ctaatgtact 
ttgagctctc ccagagtctg tgtagagtgt 
aggtaagtga cagataggta gactcagctt 
cctctagtta gcatta 



atattttgtt aaacgatgga tggtgagtgc 60 
gattaagtat gaagagttca agatacatgg 120 
gcatttttgt catatttttt ctactttgct 180 
aattctccat atgcttgttt aactattgca 240 
tcttcaacca gaaaaaccca ttgttctaaa 3 00 
agaatgagtt attctaagga tttctacttt 3 60 
cagagatttc aacttagtca ataaaacctt 420 
gcatgagtag ttggtgattt tgtacattca 480 
tgtgcattat gtagtataaa ggaggtgacc 540 
ctctgcttct cataggacta cctctaccca 600 

616 



<210> 226 
<211> 624 
<212> DNA 
<213> Homo sapiens 

<400> 226 

gattaagctt ttcatgattc ctcatagaac atgaactcaa aagaggtcag caaaggggtg 60 
tgtgcgattc tttgctattg gctgcagcta tagccctgcc tccttctcca gcacataaat 12 0 
ctttcagcag cttggctgaa gactgctgtg cagggcaggg aagctccagg caaacagccc 180 
agcaaacagc agcactcagc taaaaggaag actcacagaa cacagttgaa gaaggaaagt 240 
ggcgatggac ctcatcccaa atttggcggt ggaaacctgg cttctcctgg ctgtcagcct 300 
ggtgctcctc tatctgtgag taactgtcca aactcctctc tttgtttcct tggacttggg 360 
gtgctaatcg ggcccctttc cccttatctg ttttgaagat caaaagagat gttcaaggag 420 
aagtagctga agtgttggac gctacaaacg catagaagtt attattatct tatgcagatc 480 
tatgaatgaa taaataagca tttctcccat ccaccttcta attttggtga ctaggagggt 540 
ttagggacag catttggtag tgggaatgat ttgattagct tagatctgac gaagactaat 600 
caatgaaaac atggcagcgg caga 624 



<210> 227 
<211> 624 
<212> DNA 
<213> Homo sapiens 

<400> 227 

gattaagctt ttcatgattc ctcatagaac 



atgaactcaa aagaggtcag caaaggggtg 60 
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tgtgcgattc tttgctattg gctgcagcta tagccctgcc tccttctcca 

ctttcagcag cttggctgaa gactgctgtg cagggcaggg aagctccagg caaacagccc 180 
agcaaacagc agcactcagc taaaaggaag actcacagaa cacagttgaa gaaggaaagt 240 
ggcgatggac ctcatcccaa atttggcggt ggaaacctgg cttctcctgg ctgtcagcct 300 
ggtgctcctc tatctgtgag taactgtcca aactcctctc tttgtttcct tggacttggg 360 
gtgctaatcg ggcccctttt cccttatctg ttttgaagat caaaagagat gttcaaggag 42 0 
aagtagctga agtgttggac gctacaaacg catagaagtt attattatct tatacagatc 48 0 
tatgaatgaa taaataagca tttctcccat ccaccttcta attttggtga ctaggagggt 540 
ttagggacag catttggtag tgggaatgat ttgattagct tagatctgac gaagactaat 600 
caatgaaaac atggcagcgg caga 624 



<210> 228 
<211> 626 
<212> DNA 
<213> Homo sapiens 



<400> 228 
gaacagttcc 
gagaccacac 
gaggctaggc 

tacagtcaat 
aagatcacca 
gtgggattta 
tcattgctgt 
tcttaattag 
tttccaaggg 
cagaaaagac 
ccatatcctt 



ctaccacgtg 
cagatggctg 
atctaggatc 
aaatatttgt 
caactaatgt 
tgaaaagtgc 
ctccaacctt 
aaatgtaaag 
gtagtccact 
atcacaaaat 
ctgggacttg 



gagcatttgc 
ggtctcccca 
tccattgagc 
taaataagga 
gagaaaaaat 
catctcttta 
caccagcgga 
aatgaatctg 
gagttcgagc 
tcattacaaa 
agtctg 



aattaaaagg 
ctcccacccc 
atcttgaata 

tgcctcttca 
gtttctgttg 
gctgaggatg 
aaactcaagg 
gggacaggta 
ttcctaaaaa 
atgtcactta 



agactgagat 
cgccccacat 
tggcttgcca 

atatattttg 
aactctagtc 
aagaatggaa 
aggtatgaaa 
gaaagtaaga 
tggtctttta 
ctgctccatg 



atagaggcag 60 
acactcagaa 120 
taatatcata 180 
tgcaaccatg 240 
tttaggccca 300 
gagaatacag 360 
ataagatgag 42 0 
tcacagtccg 480 
tctttatgta 540 
ctggagaaag 60 0 
626 



<210> 229 
<211> 60 

<212> PRT 

<213> Homo sapiens 



<400> 229 

Ser Val Phe Thr Asn Arg Arg Ser Leu Gly Pro Val Gly Phe Met Lys 
15 10 15 

Ser Ala He Ser Leu Ala Glu Asp Glu Glu Trp Lys Arg He Gin Ser 
20 25 30 

Leu Leu Ser Pro Thr Phe Thr Ser Gly Lys Leu Lys Glu Met Phe Pro 
35 40 45 

He He Ala Gin Tyr Gly Asp Val Leu Val Arg Asn 
50 55 60 



<210> 230 
<211> 626 
<212> DNA 
<213> Homo sapiens 

<400> 230 

gaacagttcc ctaccacgtg gagcatttgc aattaaaagg agactgagat atagaggcag 60 
gagaccacac cagatggctg ggtctcccca ctcccacccc cgccccacat acactcagaa 120 
gaggctaggc atctaggatc tccattgagc atcttgaata tggcttgcca taatatcata 180 
tacagtcaat aaatatttgt taaataagga tgcctcttca atatattttg tgcaaccatg 240 
aagatcacca caactaatgt gagaaaaaat gtttctgttg aactctagtc tttaggccca 300 
gtgggattta tgaaaagtgc catctcttta gctgaggatg aagaatggaa gagaatacgg 360 
tcattgctgt ctccaacctt caccagcgga aaactcaagg aggcatgaaa ataagatgag 42 0 
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tcttaattag aaatgtaaag aatgaatctg 
tttccaaggg gtagtccact gagttcgagc 
cagaaaagac atcacaaaat tcattacaaa 
ccatatcctt ctgggacttg agtctg 



gggacaggta gaaagtaaga tcacagtccg 480 
ttcctaaaaa tggtctttta tctttatgta 540 
atgtcactta ctgctccatg ctggagaaag 600 

626 



<210> 231 
<211> 626 
<212> DNA 
<213> Homo sapiens 



<400> 231 

gaacagttcc ctaccacgtg gagcatttgc 

gagaccacac cagatggctg ggtc.tcccca 

gaggctaggc atctaggatc tccattgagc 

tacagtcaat aaatatttgt taaataagga 

aagatcacca caactaatgt gagaaaaaat 

gtgggattta tgaaaagtgc catctcttta 

tcattgctgt ctccaacctt caccagcgga 

tcttaattag aaatgtaaag aatgaatctg 

tttccaaggg gtagtccact gagttcgagc 

cagaaaagac atcacaaaat tcattacaaa 

ccatatcctt ctgggacttg agtctg 



aattaaaagg agactgagat atagaggcag 6 0 
ctcccacccc cgccccacat acactcagaa 120 
atcttgaata tggcttgcca taatatcata 180 
tgcctcttca atatattttg tgcaaccatg 240 
gtttctgttg aactctagtc tttaggccca 300 
gctgaggatg aagaatggaa gagaatacgg 3 60 
aaactcaagg aggtatgaaa ataagatgag 42 0 
gggacaggta gaaagtaaga tcacagtcca 48 0 
ttcctaaaaa tggtctttta tctttatgta 540 
atgtcactta ctgctccatg ctggagaaag 600 

626 



<210> 232 
<211> 623 
<212> DNA 

<213> Homo sapiens 

<400> 232 

attggacatg atagctagat ttgtttcagg 

aatgtttttg ttcactggtg actcaggtaa 

gggagggaag tcaagaaggg aggttgagga 

agtcactttc tgcgaaagaa atctctcctt 

gctgatgatt gactcccaga attcgaaaga 

gcttctgagg gctactggcg gggacactaa 

gaagtattcc aggaagatga gaatttttgc 

tataaatggt agctggaggc actttccaga 

aaagggcaac cctaagcaaa cctagaatgc 

ctacatgaga tcaaatgcca gttctcagcc 

cttggaaatt tatatcaaac ata 



aaaacatcct gctttccaag gatttagatg 60 
cacgtcttca agaagccata gggaggttga 12 0 
ctgcactttt gatttacttc tgacttcacg 180 
ttgcttctag caccgactag atttccttca 240 
aactgagtcc cacaaaggta accaaggagt 30 0 
gagggagggc cttgttctga aaatgtgcag 360 
cacatagcag aacaacacac atttagatgt 420 
agcccacagg tatagccatg ttccaggctg 480 
ttggaggaca gtcagtggtt tgtggatcac 540 
tcctccagat ccaccaagtg agaacctcta 60 0 

623 



<210> 233 
<211> 616 
<212> DNA 
<213> Homo sapiens 

<400> 233 

caggcctggc acagagtcag tgctccataa atattttgtt aaacgatgga tggtgagtgc 60 

ttttactatc cagtatttac ccagcttata gattaagtat gaagagttca agatacatgg 120 

tgttaagagt cgtttttata tgcttgcaaa gcatttttgt catatttttt ctactttgct 18 0 

tccatctttt cttctttcac ttcatttatt aattctccat atgcttgttt aactattgta 240 

gatccccttg aaattagaca cgcaaggact tcttcaacca gaaaaaccca ctgttctaaa 3 00 

g^gtgg'attca agagatggaa ccctaagtgg agaatgagtt attctaagga tttctacttt 360 

ggtcttcaag aaagctgtgc cccagaacac cagagatttc aacttagtca ataaaacctt 420 

gaaataaaga tgggcttaat ctaatgtact gcatgagtag ttggtgattt tgtacattca 480 

ttgagctctc ccagagtctg tgtagagtgt tgtgcattat gtagtataaa ggaggtgacc 540 

aggtaagtga cagataggta gactcagctt ctctgcttct cataggacta cctctaccca 600 

cctctagtta gcatta 616 
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<210> 234 

<211> 59 

<212> PRT 

<213> Homo sapiens 

<400> 234 

Met Arg Phe Ala Leu Met Asn Met Lys Leu Ala Leu lie Arg Val Leu 
15 10 15 

Gin Asn Phe Ser Phe Lys Pro Cys Lys Glu Thr Gin lie ,Pro Leu Lys 
20 25 30 

Leu Asp Thr Gin Gly Leu Leu Gin Pro Glu Lys Pro Thr Val Leu Lys 
35 40 45 

Val Asp Ser Arg Asp Gly Thr Leu Ser Gly Glu 
50 55 



<210> 235 
<211> 830 
<212> DNA 

<213> Homo sapiens 

<400> 235 

tggtcaccca ccatgtgtac agtaccctgc 

ccagactgtg cccttgagga actcacctct 

gggtggtaga gaggaaatag gacaatagga 

gaggaaatgg ttacatctgt gtgaggaggt 

tgtctggctg ggcttgcaag gatgtgtagg- 

gcagagggaa ttgcatgggt aaagatctgc 

attctggaat gaagacagcc atggaaacaa 

atgccaatgg ctccacttca gtttctgata 

aactgattaa gttgtttatg attccccata 

ggtgtgtgcg attctttgct accggctgca 

aaacatttca gcagcttgac ctaagactgc 

gcccagcaaa caacagcaca cagctgaaag 

aagtggcgat ggacctcatc ccaaatttgg 

gcctggtgct cctctatctg tcagtaactg 



tagggtccag ggtcatgaaa gtaaataata 60 
gctaagggaa acaggcacag aaacccacaa 12 0 
ctgtgtgagg gggataggag gcacccagag 180 
tggtaaggaa agactttaat agaaggggtc 240 
agtcatctag ggggcacaag tacactccag 300 
agttgtggct tgtggggatg gatttcaagt 3 60 
gggcaggtga gaggatattt aagaggcttc 42 0 
agaactcagg ttccgtggac tccctgataa 480 
gaatatgaac tcaaaggagg taagcaaagg 540 
gctgcagccc cacctccttc tccagcacat 600 
tgtgcagggc agggatgctc caggcagaca 660 
taagactcag aggagacagt tgaagaaggc 72 0 
cggtggaaac ctggcttctc ctggctgtca 780 
tccagattcc tctcctctgt 830 



<210> 236 
<211> 775 
<212> DNA 
<213> Homo sapiens 



<400> 236 

ggagtgacct gattttccag gtgctgtctg 

tccttgaccc cttgcgcttc tcaagtgagg 

agcacgctgc acccactgtc ctgcacccac 

gtacctcaga tgaaaatgca gaaattaccc 

agaccggagc tgttcctatt cg^ccatctt 

tgaaagtgta ttgatgtggg aaggagataa 

gtctgagaca atgcatgtga taaaaggtca 

cactggaaag aacgtgatgc tgggagtccc 

accacaaggc tcccagccta gtttcactga 

cttggtagga caagccttga aaagttttac 

atgatgtcac gtgtgtattt gtgtgtgtgt 

ataaagcagg ctgtgaagag gggattccca 

aaacgctttg cgaaaccaca agtttgcaca 



tcaccccttt ctttgactag gaaagggaac 60 
caatgcctcg ccctgcttcg gcttgtgcac 12 0 
tgtctggcac tccctagtga gatgaacccg 180 
gtcttctgtg tcactcacac tgggagctgt 240 
ggctccaccc cccgagtttt ggcttttaat 300 
tgccatgcat ttatgagcac atattagagg 3 60 
cctaaggaag aaaaaaagaa caagggaaga 420 
tgggccacca aagtctggag aaaagtggta 480 
agacctcgac actaggtgac ttatgggatc 540 
aatagcaaat gtggacgttg tcagaaccaa 600 
gtcagtgtgt gtgtttaaaa atcatgacaa 660 
tgctcgtgtg cctgataaca caactatcac 720 
aaggcaatcc caaccttaca caaaa 775 



<210> 237 
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<211> 25 
<212> DNA 
<213> Homo 



sapiens 



<400> 237 
ttgttgggaa 



atgttttgtc ctatc 



25 



<210> 238 

<211> 24 

<212> DNA 

<213> Homo sapiens 

<400> 238 

acagggagtt gaccttcata cgtt 24 



<210> 239 

<211> 33 

<212> DNA 

<213> Homo sapiens 

<400> 239 

tcagggtctc tggaaatttg acacagagtg eta 33 
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